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So-called  Measure-Correlate-Predict  (MCP)  methods  have  been  extensively  proposed  in  renewable 
energy  related  literature  to  estimate  the  wind  resources  that  represent  the  long-term  conditions  at  a 
target  site  where  a  short-term  wind  data  measurement  campaign  has  been  held. 

The  main  differences  between  the  various  MCP  methods  lie  fundamentally  in  the  type  of  relationship 
established  between  the  wind  data  (speed  and  direction)  recorded  at  the  target  site  and  the  wind  data 
recorded  simultaneously  at  one  or  various  nearby  weather  stations  which  serve  as  reference  stations  and 
for  which  long-term  data  series  are  also  available. 

This  paper  reviews  a  wide  range  of  MCP  methods  that  have  been  proposed  in  the  context  of  wind 
energy  analysis,  a  number  of  which  have  been  implemented  in  wind  energy  industry  software 
applications.  This  review  includes  the  initial  methods  first  proposed  in  the  1940s  which  generally 
attempted  only  to  estimate  the  long-term  mean  annual  wind  speed  from  a  single  reference  station,  and 
extends  up  to  the  most  recent  methods  proposed  in  the  present  century  based  on  automatic  learning 
techniques  which  use  several  reference  stations. 

In  addition  to  offering  a  description  of  the  linear,  non-linear  and  probabilistic  transfer  functions  used 
by  the  different  algorithms,  the  hypotheses  on  which  these  functions  are  based  and  the  data  format  with 
which  the  various  methods  work  (time  series  or  frequency  distributions),  this  review  will  also  cover 
limitations  in  the  use  of  MCP  methods,  the  uncertainty  associated  with  them  and  the  different  reference 
data  sources  that  have  been  studied.  In  this  sense,  the  extensive  collection  of  MCP  methods  which  is 
brought  together  and  reviewed  in  this  paper,  ranging  from  the  simplest  and  easiest-to-use  models  to  the 
most  complicated  computational  ones  which  require  specific  user  experience,  comprises  an  extremely 
useful  catalogue  when  it  comes  to  choosing  the  best  predictor  method. 
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1.  Introduction 

Due  to  the  interannual  variability  of  wind  speed  [1-3]  various 
authors  [2,5,6,9-16]  have  indicated  that  wind  data  measured  over 
just  a  few  years  are  insufficient  to  reflect  the  average  conditions 
present  over  the  lifetime  of  a  wind  project  (typically  20  years). 

To  estimate  the  long-term  wind  resource  at  a  candidate  site  a 
long  series  of  wind  data  needs  to  be  available  for  that  site.  There  is 
no  general  agreement  as  to  the  recommended  time  period  that 
needs  to  be  covered  by  such  wind  data  time  series  for  a  candidate 
site.  Some  authors  suggest  data  availability  is  required  for  a  three 
year  period  [12],  some  for  a  10  year  period  [6,13],  whilst  others 
state  that  20  or  even  30  years  are  required  to  correctly  characterise 
the  long-term  wind  resource  of  a  site  [2,10,14], 

As  a  result  of  the  expense  associated  with  carrying  out 
measurement  campaigns  when  no  historical  series  of  wind  data 
are  available  at  the  target  site  and  as  a  consequence  of  the  expense 
caused  by  the  delay  [4,15,16]  to  the  development  of  the  wind  farm, 
such  measurement  campaigns  tend  to  be  restricted  to  12  months 
or  less  [17],  It  is  highly  unlikely  that  such  a  short  measuring  period 
will  provide  results  that  reflect  the  long-term  conditions  [18], 

A  possible  alternative  approach  [13]  involves  linking  the 
candidate  site  with  one  or  more  nearby  sites  where  long  series 
of  wind  data  measurements  have  been  recorded.  Such  a  link  can 
be  achieved  by  using  both  physical  and  statistical  methods.  The 
latter  have  been  extensively  proposed  in  the  scientific  literature 
for  this  purpose  [2,4-5,7,9,11-17,19-22]  and  are  commonly 
known  as  Measure-Correlate-Predict  (MCP)  methods. 

3.3.  Basic  idea  behind  the  approach  of  MCP  methods 

The  aim  when  using  MCP  methods  is  to  perform  long-term 
hindcasting  of  the  wind  conditions  at  a  candidate  site  for  which 
only  short-term  wind  data  series  are  available.  This  hindcasting  is 
based  on  the  use  of  historical  and  homogenous  wind  data  series 
which  are  available  for  nearby  weather  stations  used  as  reference 
stations.  These  methods  also  require  part  of  the  long-term  data 
series  from  the  reference  stations  to  coincide  in  length  and  time 
period  with  the  short-term  data  recorded  at  what  is  variously 
described  in  the  scientific  literature  as  the  target  site,  candidate 
site  or  survey  site.  Given  the  higher  frequency  of  use  of  the  term 
‘target  site’,  this  description  will  be  used  hereafter  in  this  review. 

The  block  diagram  seen  in  Fig.  1  shows  the  procedure  normally 
employed  by  MCP  methods.  As  can  be  seen,  the  procedure 
basically  comprises  two  stages.  In  the  first  stage  (indicated  by  an 
encircled  number  “1"  in  Fig.  1),  using  as  a  starting  point  the  wind 


1  Numerical  wind  flow  models  based  on  theoretical  approaches  and  which  can 
be  classified  into  categories  of  different  degrees  of  complexity. 


data  series  recorded  at  the  reference  and  target  sites  for  the  short¬ 
term  period  that  is  common  to  both  (concurrent  data  period),  the 
aim  is  to  establish  a  relationship  between  the  two  sets  of  data. 
In  the  second  stage  (indicated  by  an  encircled  number  “2”  in 
Fig.  1 ),  the  long-term  wind  data  series  available  for  the  reference 
stations  are  applied  as  input  variables  to  the  relationship  estab¬ 
lished  in  stage  1  in  order  to  perform  long-term  hindcasting  of  the 
wind  conditions  at  the  target  site.  The  few  MCP  methods  which  do 
not  follow  the  procedure  outlined  in  Fig.  1  determine  the  relation¬ 
ship  between  the  short-  and  long-term  data  of  the  reference 
station  and  then  apply  it  to  the  short-term  data  available  for  the 
target  site  with  the  aim  of  obtaining  data  that  represent  its  long¬ 
term  wind  conditions. 


3.2.  Importance  of  MCP  methods  in  wind  resource  estimation 

According  to  Addison  et  al.  [22],  MCP  methods  tend  to  provide 
higher  accuracy  than  physical  modelling  methods  (for  example, 
WAsP2  [20,23]),  especially  in  complex  environments.  Physical 
models  also  introduce  non-quantifiable  uncertainties  in  the  pre¬ 
diction.  For  these  reasons,  MCP  methods  have  evolved  to  become  a 
standard  tool  for  wind  farm  developers  and  have  been  implemen¬ 
ted  in  wind  energy  industry  software  applications  [24-30]  for 
long-term  wind  speed  forecasts.  As  reported  in  a  study  by  Derrick 
et  al.  [31]  in  1996,  almost  all  the  consultants  who  have  carried  out 
measurements  to  assess  the  wind  regime  at  a  site  have  used  some 
form  of  measure-correlate-predict  strategy  to  forecast  the  long¬ 
term  wind  regime.  Sanz  [32]  presents  the  results  from  a  wind 
resource  assessment  questionnaire  called  WAUDIT3  which  had 
been  sent  during  the  first  half  of  2010  to  wind  analysts  ranging 
from  industrial  to  academic  sectors  at  European  organisations 
operating  in  the  wind  energy  field.  The  purpose  of  the  question¬ 
naire  was  to  construct  a  knowledge  map  for  wind  resource 
assessment  methods  in  the  industry.  A  total  of  72  wind  analysts 
from  48  different  organisations  and  13  European  countries  con¬ 
tributed  to  the  survey.  To  the  surprise  of  the  author  of  the  report, 
despite  the  importance  of  MCP  methods  for  wind  energy  devel¬ 
opers,  only  70%  of  all  respondents  (80%  in  the  case  of  consultants) 
said  that  they  used  these  methods. 


2  WAsP  (Wind  Atlas  Analysis  and  Application  Program)  is  a  model  developed 
by  the  Wind  Energy  Department  at  Rise  National  Laboratory  for  wind  resource 
assessment.  WAsP  is  probably  the  most  widely  used  numerical  wind  flow  model  in 
the  wind  energy  industry. 

3  WAUDIT  is  an  Initial  Training  Network  (1TN)  Marie-Curie  action  financed  by 
the  FP7-PEOPLE  programme  (Project  Reference:238576)  of  the  European  Commis¬ 
sion.  Under  the  subject  of  “Wind  Resource  Assessment  Audit  and  Standardization”, 
the  aim  of  WAUDIT  is  to  set  up  a  group  of  researchers  in  the  field  of  wind  resource 
assessment,  for  the  purpose  of  supporting  the  technological  development  of  wind 
energy  as  one  of  the  world’s  fastest  growing  industries. 
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A  orthogonal  matrix  of  L  rows  and  L  columns,  formed  by 

eigenvectors  of  the  matrix  product  of  V(L  x  M)  and  its 
transpose  VT(M  x  L); 

A  parameter  defined  by  Eq.  (76); 

aik  element  of  matrix  A,  Eq.  (94); 

a0,ak  parameters  of  Eq.  (5); 

ANNs  artificial  neural  networks; 

AR  matrix  where  each  element  ar,  ,  represents  the  average 
ratio  corresponding  to  wind  speed  bin  i  of  the  wind 
direction  sector  j  of  the  reference  site; 
ary  element  of  the  matrix  AR,  Eqs.  (68)-(70); 

ASOS  automated  surface  observing  system; 

B  matrix  in  which  an  element  fay,  represents  the  wind 

speed  bin  i  and  wind  direction  sector  j  of  the 
reference  site; 

fa  wind  speed  bin,  Eq.  (29); 

bj  Bin  i  of  the  wind  speed,  Eq.  (18); 

fay  element  of  the  matrix  B  which  contains  the  set  of 

ratios  calculated  between  each  target  site  wind  speed, 
(Vk)7’  which  is  paired  by  date  and  time  k  with  the 
corresponding  reference  site  wind  speed,  ( vk)fT ; 
bk  parameter  of  Eq.  (5); 

BMARS  Bayesian  multivariate  adaptive  regression  spline; 

BNs  Bayesian  networks; 

C  matrix  of  coefficients  Cy-; 

C  Weibull  scale  parameter; 

CDF  cumulative  distribution  function; 

CFSR  climate  forecast  system  reanalysis; 

Cy  coefficients,  Eq.  (80),  of  matrix  C,  which  assuming  that 
they  remain  constant  over  the  long-term,  are  used  to 
estimate  the  long-term  relative  frequencies  of  occur¬ 
rence  of  category  i  at  the  reference  site  and  of  category 
j  at  the  target  site,  Eqs.  (80)  and  (81); 

CNF  normalization  factor,  Eqs.  (86)-(89); 

CPMF  cumulative  probability  mass  function; 

CPMFj  cumulative  probability  mass  function  of  the  target  site 
wind  speed  for  bin  j  of  the  reference  site; 

CSQ  Chi-square  regression; 

c(k)  the  simple  autocovariance  function,  Eqs.  (2)  and  (3); 
C'T,  Cf7  short-term  Weibull  scale  parameters  of  the  reference 
and  target  site,  respectively,  Eqs.  (65)  and  (67); 
d  variable  representing  wind  direction, 

Eq.  (35); 

ddkz  represents  the  distance  between  station  k  and  station 
z,  Eq.  (101); 

(dj)f:T,(dk)rT  long-term  reference  site  wind  directions; 

(dj) rT ,(dk)fT  short-term  reference  site  wind  directions,  Eq.  (73); 

(dk) 7  short-term  target  site  wind  directions,  Eq.  (73); 

E  square  matrix  of  N  x  N  wind  direction  sectors,  Table  1 ; 

E'  square  matrix  ofNxN  wind  direction  sectors  whose 

elements  are  null  or  coincide  with  those  of  matrix  E  if 
the  restriction  imposed  by  Eq.  (52)  is  met,  Table  2; 
ECMWF  European  centre  for  medium-range  weather  forecasts; 
ey  Element  of  matrix  E  which  contains  the  number  of 
times  the  wind  has  blown  simultaneously  in  target 
site  sector  i  and  reference  site  sector  j,  Eq.  (52); 
e'y  element  of  the  matrix  E',  Eqs.  (52)  and  (53); 

EWEA  European  wind  energy  association; 

FFT  fast  Fourier  transform; 


(F j)rT,  (Fj)[T  long-term  frequencies  of  occurrence  of  category  i 
at  the  reference  site  and  of  category  j  at  the  target  site, 
Eq.  (81); 

(Fi)srT,  (Fj)^T  short-term  frequencies  of  occurrence  of  category  i 
at  the  reference  site  and  of  category  j  at  the  target  site, 
Eq.  (79); 

long-term  frequency  of  wind  speed  bin  i  and  wind 
direction  sector  j  at  the  reference  site,  Eqs.  (17)  and  (18); 
short-term  frequency  of  wind  speed  bin  i  and  wind 
direction  sector  j  at  the  reference  site,  Eqs.  (17)  and 

(18); 

long-term  wind  speed  frequency  in  wind  direction 
sector  j  of  the  reference  site,  Eqs.  (17)— (19); 
short-term  wind  speed  frequency  in  wind  direction 
sector  j  of  the  reference  site,  Eqs.  (17)— (19); 
frequency  of  occurrence  of  WPT  j  at  the  target  site  t, 
Eq.  (13); 

long-term  wind  speed  probability  density  function  of 
the  reference  site,  Eq.  (86); 

long-term  wind  speed  frequency  at  the  reference  site 
for  direction  sector  0r,  Eq.  (92); 
short-term  wind  speed  frequency  at  the  reference  site 
for  direction  sector  6r,  Eq.  (92); 

long-term  wind  speed  probability  density  function  of 
the  target  site,  Eq.  (86); 

long-term  wind  speed  frequency  at  the  target  site  for 
direction  sector  6r,  Eq.  (92); 

short-term  wind  speed  probability  density  functions 
of  the  target  site,  Eq.  (83); 

wind  speed  probability  density  function  at  the  target 
site  conditioned  on  the  reference  site  wind  speed,  Eq. 

(83); 

short-term  wind  speed  frequency  at  the  target  site  for 
direction  sector  0r,  Eq.  (92); 

F^T (vt),F^T (vr)  cumulative  distribution  functions  of  wind  speed 
at  the  target  and  reference  site,  respectively,  Eqs.  (83)- 

(85); 

F(vt,vr )  bivariate  Weibull  distribution  function,  Eq.  (90); 

/[■]  symbol  representing  a  first-order  linear  function,  Eq. 

(5); 

G  matrix  where  each  element  or  cell,  gjj,  corresponds  to 

a  wind  speed  bin  i  (with  default  width  of  1  m/s)  and 
wind  direction  sector  j  (default  value  of  30°)  of  the 
reference  site; 

GCM  general  circulation  model; 

gij  element  of  the  matrix  G; 

GOR  general  orthogonal  regression  method; 

H  square  matrix  (NfaxNfa)  which  constitutes  a  joint 

relative  frequency  distribution  of  short-term  wind 
speed,  wind  direction  and  atmospheric  stability; 

hfj  element  of  matrix  H  which  represents  the  short-term 
relative  frequency  of  occurrence  of  category  i  at  the 
reference  site  and  category  j  at  the  target  site,  Eqs.  (79) 
and  (80); 

ITC  installations  of  the  Canary  Technological  Institute 

(Spanish  initials); 

1TN  initial  training  network; 

I<  Weibull  shape  parameter; 

Kr  number  of  time  lags  considered,  Eqs.  (2)  and  (3); 

l<srT,  I<stT  short-term  Weibull  shape  parameters  of  the  reference 
site  and  target  site,  respectively,  Eqs.  (65)  and  (67); 

L  number  of  data  of  a  wind  speed  time  series,  Eq.  (94); 

LT  long-term; 


(fij)rT 

(fj)rT 

(fjf 

(FOj)t 

fr\v) 

fr\0  r) 

r) 

/f(V) 

/tVr) 

/f'(Vt) 

/f(Vt|Vr) 

ff'(A) 
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M  number  of  parameters, aky  bk,  in  Eq.  (5).  Span  of  data, 
Eqs.  (31)  and  (32).  Number  of  meteorological  stations, 
Eq.  (94); 

MCP  measure-correlate-predict; 

MCRT  multiple  climatic  reduction  technique; 

MERRA  modern  era  retrospective-analysis  for  research  and 
applications; 

MMCP  multivariate  MCP  methodology; 

MSD  moulded  site  data  method  or  windscale  method; 

MTS  matrix  time  series  method; 

N  number  of  wind  direction  sectors  under  consideration, 

Eq.  (52); 

n  number  of  wind  speed  data  under  consideration,  Eqs. 

(1),  (3),  (4),  (23),  (27),  (31),  (32),  and  (40); 

Na  number  of  eigenvectors  that  need  to  be  chosen  such 
that  Eq.  (95)  conserves  a  significant  percentage  of  the 
variance  expressed  by  the  trace  of  WT. 
Na<  minimum  [L,M],  Eqs.  (95)-(97); 

NB  number  of  wind  speed  bins,  Eqs.  (17)  and  (18); 

NCAR  National  Centre  for  Atmospheric  Research; 

NCEP  National  Centres  for  Environmental  Prediction; 

Nh  number  of  columns  (or  rows)  of  matrix  H,  Eq.  (79); 

riij  number  of  ratios  stored  in  the  element  by  of  matrix  B, 

Eqs.  (68)  and  (69); 

NP  order  of  the  polynomial,  Eq.  (74); 

Nw,  Nd  numbers  of  wind  speed  and  wind  direction  bins  of  the 
joint  probability  mass  function,  Eq.  (82); 

NWP  numerical  weather  prediction; 

OLR  ordinary  linear  regression; 

OR  orthogonal  regression; 

Pdf  probability  density  function; 

PMF  probability  mass  function; 

PMFj  probability  mass  function  of  the  target  site  wind  speed 

conditioned  on  the  reference  site  wind  speed  of  bin  j; 
p(7(v,  d)  probability  mass  function  of  the  long-term  reference 
site  wind  speed  and  direction,  Eq.  (82); 
pf-T(v,  d)  probability  mass  function  of  the  long-term  target  site 
wind  speed  and  direction,  Eq.  (82); 
pfLr(Vt,dt,  vr,dr)  joint  probability  mass  function  of  the  short¬ 
term  wind  speeds  (vt,vr)  and  directions  (dt,dr)  of  the 
target  (t)  and  reference  (r)  sites,  Eq.  (82); 
p^T(Vi,dj\vk,dz)  probability  density  function  of  the  target  site 
wind  speed  conditioned  on  the  reference  site  wind 
speed,  Eqs.  (88)  and  (89); 

pLT[(d,jt]  long-term  probability  that  the  wind  blows  in  the 
direction  corresponding  to  sector  j  of  the  target  site 
(t),  Eqs.  (9),  (10),  and  (12); 

pLT[(0k)r]  long-term  probability  that  the  wind  blows  in  the 
direction  corresponding  to  sector  k  of  the  reference 
site  (r),  Eqs.  (9),  (10)  and  (12); 

psl[(Oj)t\(0k)r]  short-term  probability  that  the  wind  blows  in  the 
direction  corresponding  to  sector  j  of  the  target  site 
(t),  conditioned  on  the  short-term  probability  that  the 
wind  blows  in  the  direction  corresponding  to  sector  k 
of  the  reference  site  (r),  Eqs.  (9),  (10),  and  (12); 
pST[  |-]  observed  conditional  probability  at  the  target  site 
during  the  short-term  measurement  period; 

QR  quantile  regression; 

qJ  transpose  of  the  column  vector,  z=t,  of  the  square 
matrix  R,  Eq.  (98); 

R  square  matrix  of  M  rows  and  M  columns,  whose 

elements  are  defined  by  Eq.  (101); 
r  Pearson  product-moment  coefficient,  Eq.  (1 ).  -l<r<l ; 


Rfj  Ratio  between  the  long-  and  short-term  wind  fre¬ 
quency  of  wind  direction  sector  j  of  the  reference  site, 
Eqs.  (19)  and  (20); 

RES  Renewable  Energy  Systems  Ltd. 

rkz  correlation  coefficient  between  the  data  of  station  k 

and  of  station  z,  Eq.  (101); 

r(k )  cross-correlation  coefficient  as  a  function  of  time  lag  k, 
Eq.  (2); 

rs  Spearman's  correlation  coefficient; 

rLT  correlation  coefficient  between  the  long-term  wind 

speeds  measured  at  the  reference  and  target  sites,  Eq. 

(7); 

Rvj  Ratio  between  the  long-  and  short-term  mean  wind 
speed  in  wind  direction  sector  j  of  the  reference  site, 
Eqs.  (19)  and  (20); 

[rST](flt)r  correlation  coefficient  between  the  short-term  wind 
speeds  measured  at  the  reference  and  target  sites 
when  the  reference  site  wind  blows  in  the  direction 
corresponding  to  sector  k,  Eq.  (11); 
rAvACj  correlation  coefficient  between  the  variables  Av  and 
Ad,  defined  in  Eqs.  (72)  and  (73); 
rl,r2  reference  stations  1  and  2,  Eq.  (93); 

R2  coefficient  of  determination,  Eq.  (4); 

SD  matrix  in  which  each  element,  sdy,  represents  the 
standard  deviation  of  the  ratios  of  wind  speed  bin  i 
and  wind  direction  sector  j  of  the  reference  site; 

(s i)fT  standard  deviation  of  the  ith  short-term  reference  site 

wind  speed,  Eq.  (27); 

(s,)f  standard  deviation  of  the  ith  short-term  target  site 
wind  speed,  Eq.  (27); 

sdij  element  of  matrix  SD,  Eqs.  (69)  and  (71); 

SOR  simple  orthogonal  regression  method; 

SPjj  estimated  statistical  property  (mean  standard  devia¬ 
tion,  correlation  coefficient)  in  cell  gifj  of  matrix  G, 
Eq.  (74); 

sjT  standard  deviation  of  observed  long-term  wind 

speeds  at  the  reference  site,  Eq.  (7); 
sf  standard  deviation  of  observed  short-term  wind 

speeds  at  the  reference  site,  Eqs.  (2)  and  (34); 
slt~st  standard  deviation  of  the  long-term  (LT)  mean  wind 
speeds  at  the  reference  site  averaged  over  periods  as 
long  as  the  short-term  (ST)  target  period,  Eq.  (8); 
[s?T](ao  standard  deviation  of  the  short-term  wind  speeds 
recorded  at  the  reference  site  when  the  wind  at  that 
site  blows  in  the  direction  corresponding  to  sector  k, 
Eq.  (11); 

ST  short-term; 

s[T  standard  deviation  of  observed  long-term  wind 

speeds  at  the  target  site,  Eqs.  (7)  and  (8); 

s;?T  standard  deviation  of  observed  short-term  wind 

speeds  at  the  target  site,  Eqs.  (2),  (8)  and  (34); 

[s?%t)r  standard  deviation  of  the  short-term  wind  speeds 
recorded  at  the  target  site  when  the  wind  at  the 
reference  site  blows  in  the  direction  corresponding 
to  sector  k,  Eq.  (11); 

sVrVt  covariance  between  the  (Vj)fT  and  the  (Vj)fT,  Eqs.  (23) 
and  (25).  Covariance  of  the  target  and  reference  site 
wind  speeds,  Eq.  (44); 

sVr  standard  deviation  of  the  reference  site  wind  speed, 

Eqs.  (42),  (46),  (48)  and  (49); 

sVt  standard  deviation  of  the  target  site  wind  speed, 

Eqs.  (43),  (46),  (48)  and  (49); 

s2r  variance  of  the  wind  speeds  (v,j)T,  Eqs.  (23),  (25) 

and  (42); 
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s2Vt  variance  of  the  (Vj)fT,  Eqs.  (25)  and  (43); 

s(Vx)r  standard  deviation  of  the  X-axis  components  of  the 
reference  site  wind  speeds,  Eq.  (42); 
s(v*)r(v*)t  covariance  of  the  X-axis  components  of  the  target  and 
reference  site  wind  speed,  Eqs.  (44)  and  (48); 
s(Vx)t  standard  deviation  of  the  X-axis  components  of  the 
target  site  wind  speeds,  Eq.  (43); 

S(vx)r(v,)t  covariance  of  the  X-axis  components  of  the  reference 
site  wind  speed  and  the  y-axis  components  of  the 
target  site  wind  speed,  Eqs.  (44)  and  (49); 
s(Vj)r  standard  deviation  of  the  y-axis  components  of  the 
reference  site  wind  speeds,  Eq.  (42); 
s(vy)r(v,)t  covariance  of  the  y-axis  components  of  the  reference 
site  wind  speed  and  the  X-axis  components  of  the 
target  site  wind  speed,  Eqs.  (44)  and  (49); 
s(Vy) t  standard  deviation  of  the  y-axis  components  of  the 

target  site  wind  speeds,  Eq.  (43); 

S(vj,)r(vy)t  covariance  of  the  y-axis  components  of  the  target  and 
reference  site  wind  speed,  Eqs.  (44)  and  (48); 

S1,S2  parameters  defined  by  Eqs.  (84)  and  (85); 

Sav,Saci  standard  deviations  of  the  variables  Av  and  Ad  defined 
in  Eqs.  (72)  and  (73); 

s2Vt  variance  of  the  errors  of  the  dependent  variable 
(target  site  wind  speed),  Eq.  (24); 
s2Vr  variance  of  the  errors  of  the  independent  variable 
(reference  site  wind  speed),  Eq.  (24); 

(tk)fT  scaled  time  interval  of  long-term  target  site  wind  data, 
Eq.  (20); 

(th)^  time  interval  of  short-term  target  site  wind  data 
measurement,  Eq.  (20); 

11,12  ends  of  the  interval  of  available  wind  data  of  a  time 
series.  T1>1, 72 <L, 12-71  +  1  <N,  Eqs.  (96),  (97)  and 
(99); 

U  matrix  in  which  an  element  Uy,  contains  the  number 

of  times  that,  during  the  concurrent  data  period, 
target  site  wind  speeds  of  bin  i  were  recorded  and 
reference  site  wind  speeds  of  bin  j  were  recorded; 

U  Random  value  extracted  from  a  uniform  distribution 

in  the  interval  (0,1),  Eq.  (71); 

Uij  element  of  the  matrix  U; 

V  matrix  whose  columns  are  time  series  with  L  data 

from  M  weather  stations; 

v  variable  representing  wind  speed,  Eq.  (35); 

vcut-off  reference-site  cut-off  speed,  Eq.  (30); 

Vij  element  of  matrix  V,  Eq.  (94); 

(Vi)yT,(Vj)yT  short-term  wind  speeds  observed  at  the  reference 
site,  Eqs.  (23),  (27),  (31)  and  (33); 

(vi)'tr(Vj)ll:l  ,(vk)l[r  long-term  wind  speeds  estimated  at  the  target 
site,  Eqs.  (14),  (16),  (20),  (21),  (29),  (30),  (33),  (34), 
(45),  (55); 

(Vj )fT,(vfc )fr,(Vj  )fr  short-term  wind  speeds  observed  at  the  tar¬ 
get  site,  Eqs.  (16),  (22),  (23),  (27),  (31)  and  (33); 
vit  target  station  wind  speed  (t),  Eqs.  (96)  and  (97); 

(Vj)^T  long-term  wind  speeds  observed  at  the  reference  site, 
Eqs.  (14),  (21),  (29),  (30),  (33)  and  (34); 

[(Vj)rT](ek>  measured  long-term  wind  speed  at  the  reference  site 
in  function  of  wind  direction  sector  k  at  the  reference 
site,  Eq.  (15); 

J  estimated  long-term  wind  speed  at  the  target  site 
<Sl)in  function  of  wind  direction  sector  k  at  the  reference 
site,  Eq.  (15); 

[(Vjlxlr1  component  X  of  the  long-term  reference  site  wind 
speed,  Eqs.  (36),  (39)  and  (45); 


[(Vj-yf  component  X  of  the  long-term  target  site  wind  speed, 
Eqs.  (36),  (38),  (39), (45)  and  (102); 

[(vj)y]rT  component  y  of  the  long-term  reference  site  wind 
speed,  Eqs.  (37),  (39)  and  (45); 

[(Vj)y]fT  component  Y  of  the  long-term  target  site  wind  speed, 
Eqs.  (37)-(39),  (45),  and  (102); 

Vmax  maximum  wind  speed,  Eq.  (62); 
vr  variable  representing  the  reference  site  wind  speeds, 

Eq.  (41); 

vst  vlt  vectors  whose  elements,  (v^T, ...,  vf7,  ...v^)7  and 
(v77, ...,  vf7,  ...v^)7  are  the  means  of  the  square  roots 
of  the  short-term,  Eq.  (99),  and  long-term,  Eq.  (100), 
wind  speeds,  respectively,  calculated  with  the 
observed  data  of  each  of  the  M  stations  under 
consideration; 

vt  variable  representing  the  target  site  wind  speeds,  Eq. 

(41); 

(vtt)^r  reference  site  wind  speed  which  is  paired  by  date  and 
time  ft  with  the  corresponding  target  site  wind  speed, 
(Vtt)f ; 

(Vtt)tT  target  site  wind  speed  which  is  paired  by  date  and 
time  tt  with  the  corresponding  reference  site  wind 
speed,  (vtt)f; 

vx  variable  representing  the  X-axis  components  of  the 

wind  speed,  Eq.  (35); 

( vx) r  variable  representing  the  X-axis  components  of  the 

reference  site  wind  speed,  Eq.  (41); 

(v*)t  variable  representing  the  X-axis  components  of  the 
target  site  wind  speed,  Eq.  (41); 
vy  variable  representing  the  y-axis  components  of  the 

wind  speed,  Eq.  (35); 

(vy) r  variable  representing  the  y-axis  components  of  the 

reference  site  wind  speed,  Eq.  (41); 

(vy)t  variable  representing  the  y-axis  components  of  the 
target  site  wind  speed,  Eq.  (41); 

(v6)f  mean  short-term  wind  speed  of  bin  b  at  the  reference 
site,  Eq.  (29); 

(v6)f  mean  short-term  wind  speed  of  bin  b  at  the  target  site, 
Eq.  (29); 

(v,)77  estimated  long-term  mean  wind  speeds  in  the  target 
site  wind  direction  sector  i,  Eqs.  (55)  and  (56); 

(v,)77  long-term  mean  wind  speeds  in  the  reference  site 
wind  direction  sector  j,  Eqs.  (18)  and  (19); 

(v,)77  measured  long-term  mean  wind  speeds  in  the  refer¬ 
ence  site  wind  direction  sector  j,  Eqs.  (55)  and  (56); 
(Vj)yT  short-term  mean  wind  speeds  in  the  reference  site 
wind  direction  sector  j,  Eqs.  (18)  and  (19); 

(vx)^r  short-term  mean  wind  speed  of  the  X-axis  wind 
component  at  the  reference  site,  Eqs.  (50)  and  (51); 

(vx) ,r  short-term  mean  wind  speed  of  the  X-axis  wind 

component  at  the  target  site,  Eq.  (50); 

(vy) yT  short-term  mean  wind  speed  of  the  y-axis  wind 

component  at  the  reference  site,  Eqs.  (50)  and  (51); 
(Vy)f7  short-term  mean  wind  speed  of  the  y-axis  wind 
component  at  the  target  site,  Eq.  (51); 
v|T  long-term  mean  wind  speed  measured  at  the  refer¬ 
ence  site,  Eqs.  (6),  (7)  and  (16); 

tfT)j  mean  observed  long-term  wind  speeds  for  WPT  j  at 
the  reference  site,  Eq.  (13); 

Of),  mean  observed  short-term  wind  speeds  for  WPT  j  at 
the  reference  site,  Eq.  (13); 

[vj:7](0|,  (  measured  long-term  mean  wind  speed  in  wind  direc¬ 
tion  sector  k  of  the  reference  site,  Eqs.  (9)  and  (11); 
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vj1  short-term  mean  wind  speed  measured  at  the  refer¬ 
ence  site,  Eqs.  (1),  (3),  (6),  (7),  (14),  (16)  and  (34); 
[vf]w  measured  short-term  mean  wind  speed  of  the  refer¬ 
ence  site  when  the  reference  site  wind  speed  blows  in 
wind  direction  sector  k,  Eqs.  (9),  (11)  and  (15); 
vjr  long-term  mean  wind  speed  estimated  the  target  site, 
Eqs.  (6),  (7)  and  (13); 

[vf](e,t  estimated  long-term  mean  wind  speed  in  wind  direc¬ 
tion  sector  j  of  the  target  site  (t),  Eqs.  (9),  (10)  and 

(12); 

[vf](fltV  estimated  long-term  wind  speed  at  the  target  site  for 
each  given  wind  direction  sector  k  at  the  reference 
site,  Eqs.  (11)  and  (12); 

vf  short-term  mean  wind  speed  measured  at  the  target 
site,  Eqs.  (1),  (3),  (4),  (6),  (7), (14)  and  (34); 

0f)j  mean  observed  short-term  wind  speeds  for  WPT  j  at 
the  target  site,  Eq.  (13); 

[v?](ok)  measured  short-term  mean  wind  speed  of  the  target 
site  when  the  reference  site  wind  speed  blows  in  wind 
direction  sector  k,  Eqs.  (9),  (11)  and  (15); 

(v'jdr7  short-term  reference  site  wind  speeds  smoothed  out 
by  use  of  the  concept  of  moving  averages,  Eqs.  (31) 
and  (32); 

(v'klf7  short-term  target  site  wind  speeds  smoothed  out  by 
use  of  the  concept  of  moving  averages,  Eqs.  (31)  and 
(32); 

(vj)f,  ,(Vj)f7  estimated  short-term  wind  speeds  of  the  target  site, 
Eqs.  (4)  and  (5); 

W  Matrix  of  L  rows  and  M  columns,  whose  elements 

correspond  to  data  projections  of  each  station  onto 
each  row  of  A  (L  x  L); 

WAsP  Wind  Atlas  Analysis  and  Application  Program; 
WAUD1T  an  ITN  Marie-Curie  action  financed  by  the  FP7- 
PEOPLE  programme  (Project  Reference:  238576)  of 
the  European  Commission; 

Weka  Waikato  Environment  for  Knowledge  Analysis; 

WF  windiness  Factor  method; 

Wk,j  element  of  matrix  W,  Eq.  (94); 

wk,t  element  of  matrix  W,  corresponding  to  column  j=t.  ‘f 
represents  the  target  station,  Eqs.  (96)  and  (97); 

WLS  weighted  total  least  squares; 

WPT  weather  pattern  type; 

Z  square  matrix  of  NxN,  whose  elements  are  expressed 

as  a  percentage  of  the  total  number  data  of  each  target 
station  sector  i,  such  that  for  each  row  (sector)  i  Eq. 
(54)  is  met,  Table  1; 

Zij  element  of  matrix  Z,  Eqs.  (53),  (55)  and  (56). 

Greek  letters 

a  offset  of  the  regression  line,  Eqs.  (21)-(23),  (27),  (30), 

(57), -(59),  (61),  (66),  (67),  (74)  and  (93); 
ax  offset  of  the  regression  line  of  the  X-axis  components 

of  the  wind  speeds,  Eqs.  (36),  (45)  and  (50); 
ay  offset  of  the  regression  line  of  the  Y-axis  components 

of  the  wind  speeds,  Eqs.  (37),  (45)  and  (51); 
az  offset  of  the  straight  line  for  the  quantile  t,  Eq.  (28); 

cti,  a2  offsets  of  the  vector  method,  Eqs.  (39)  and  (40); 

p  slope  of  the  regression  line,  Eqs.  (21),  (22),  (23),  (27), 

(30),  (33),  (66)  and  (67); 

Pij  slopes  of  the  vector  method,  Eqs.  (39)  and  (40); 

Pj,  Pi,  aj  slope  and  offset,  respectively,  at  the  origin  of  the 
straight  lines,  Eqs.  (55)  and  (56); 


pkk  polynomial  fit  parameters,  Eq.  (74); 

px  slope  of  the  regression  line  of  the  X-axis  components 

of  the  wind  speeds,  Eqs.  (36),  (45),  (46),  (50)  and  (51); 
py  slope  of  the  regression  line  of  the  Y-axis  components 

of  the  wind  speeds,  Eqs.  (37),  (45),  (47),  (50)  and  (51); 
/?i  slope  of  the  regression  line,  Eqs.  (57)-(61)  and  (93); 

p2  slope  of  the  regression  line,  Eqs.  (30),  (57)  and  (93); 

/13  slope  of  the  regression  line,  Eq.  (32); 

/14  coefficient  of  interaction,  Eqs.  (57)  and  (59); 

ps  quadratic  coefficient,  Eqs.  (58),  (60)  and  (61); 

Pe  cubic  coefficient,  Eq.  (61); 

/hi,  Pn.  A21,  P22  slopes  of  the  vector  method,  Eq.  (40); 
pr  slope  of  the  straight  line  for  the  quantile  r,  Eq.  (28); 

y  Weibull  location  parameter;  parameter  of  the  equa¬ 

tion  of  fit  between  the  data  rkfZ  and  ddkz,  Eq.  (101); 
yfT,yfT  short-term  Weibull  location  parameters  of  the  refer¬ 
ence  and  target  site,  respectively,  Eq.  (65); 

A Tr  temperature  gradient  or  difference  between  tempera¬ 

tures  of  two  heights  at  the  reference  site,  Eq.  (59); 

Av  variable  representing  the  differences  between  wind 
speeds  (speed-up)  at  the  target  and  reference  sites,  Eq. 
(75); 

Av*.  differences  between  wind  speeds  (speed-up)  at  the 
target  and  reference  sites,  Eq.  (72); 

Avlt  variable  representing  long-term  differences  between 
wind  speeds  (speed-up)  at  the  target  and  reference 
sites,  Eq.  (77); 

Ad  variable  representing  the  differences  between  wind 
direction  (wind  veer)  at  the  target  and  reference  sites; 
Adk  differences  between  wind  direction  (wind  veer)  at  the 
target  and  reference  sites,  Eq.  (73); 

AdLT  variable  representing  long-term  differences  between 
wind  direction  (wind  veer)  at  the  target  and  reference 
sites,  Eq.  (78); 

S  Small  fraction  of  the  total  number  of  data  recorded  in 

each  wind  direction  sector  which  indicates  the  ele¬ 
ments  or  bins  which  must  be  rejected,  Eq.  (52). 
Parameter  of  Eq.  (63).  Weibull  shape  factors  ratio, 
Eqs.  (66)  and  (67); 

£j,  £\  white  noise,  Eqs.  (21),  (30),  (58)-(61),  (74)  and  (93); 
ejj  random  variable  with  symmetric  triangular  distribu¬ 

tion,  Eqs.  (70)  and  (71); 
eft  mean  square  errors,  Eq.  (96); 

eXj  error  associated  with  the  X-axis  component  of  the 
target  site  wind  speed,  Eq.  (36); 

£yi  error  associated  with  the  Y-axis  component  of  the 
target  site  wind  speed,  Eq.  (37); 
evr  error  associated  with  the  reference  site  wind  speed 
(independent  variable),  Eqs.  (24)  and  (25); 

£Vt  error  associated  with  the  target  site  wind  speed 
(dependent  variable),  Eqs.  (24)  and  (25); 
parameter  of  the  equation  of  fit  between  the  data  rk,z 
and  ddkz,  Eq.  (101);  parameter  of  Eqs.  (62)-(64); 

(di)^  short-term  reference  site  wind  direction  sectors, 
Eq.  (5); 

(0j)t  wind  direction  sector  j  of  the  target  station  (t),  Eq.  (9); 

(0k)r  wind  direction  sector  k  of  the  reference  station  (r), 

Eq.  (9); 

6r  reference  site  wind  direction  sector; 

X  Parameter  which  controls  the  degree  of  association 

between  the  variables  vs  and  vr,  Eq.  (90).  0<2<1. 
Represents  scale  parameter  C  or  shape  parameter  K  of 
the  Weibull  distribution,  Eq.  (91).  Ratio  between  the 
variances  of  the  errors  of  the  dependent  variable 
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(target  site  wind  speed)  and  independent  variable 
(reference  site  wind  speed),  Eqs.  (24)  and  (25).  Ratio 
between  the  sum  of  the  variances  of  the  errors  of  the 
dependent  variable  and  of  the  equation  and  the 
variance  of  the  independent  variable,  Eq.  (26); 

^Avd'Ad  mean  of  the  variables  Av  and  Ad,  Eq.  (75); 

Ov  v  )x  real  component  of  the  correlation  vector,  Eqs.  (46)  and 
(48); 


(Pvrv,)y  imaginary  component  of  the  correlation  vector,  Eqs. 
(47)  and  (49); 

pVrVt  correlation  vector  between  target  and  reference  site 
wind  speeds; 

aEEq  error  of  the  equation,  Eq.  (26); 
r  quantile  (0<r<l),  Eq.  (28); 

X2  chi-square  statistic,  Eq.  (27); 

y/  coefficient  of  association,  Eqs.  (83)-(85). 


Landberg  and  Mortensen  [33]  published  a  study  in  1993  which, 
they  declared,  aimed  to  throw  some  light  on  the  on-going 
argument  between  supporters  of  physical  methods  for  wind 
resource  estimation,  such  as  WAsP,  and  supporters  of  statistical 
methods  like  MCP.  According  to  the  authors,  the  study  shows  that 
both  WAsP  and  MCP  methods  are  able  to  make  useful  forecasts  of 
long-term  wind  conditions  at  the  selected  sites  of  the  target  area. 
They  also  state  that  WAsP  functions  quite  well,  despite  violation  of 
the  assumptions  underlying  the  flow  model.  Bowen  and  Morten¬ 
sen  [34],  however,  in  a  study  they  performed  on  WAsP  prediction 
errors  as  a  result  of  site  orography,  conclude  that  these  errors  may 
be  significant  if  they  exceed  the  performance  envelopes  of  WAsP 
for  the  climate  and  terrain.  In  fact,  according  to  Brower  [9J,  WAsP 
failed  when  it  was  used  for  the  first  time  in  coastal  mountain 
passes  in  the  US  (where  the  first  wind  farms  of  that  countiy  were 
constructed),  as  it  was  unable  to  accurately  predict  the  wind 
resource  distribution.  Wind  flow  modelling  was  held  in  low 
esteem  in  US  wind  resource  assessment  circles  for  many  years  as 
a  result  of  this  experience.  Nevertheless,  Brower  [9]  admits  that 
WAsP  remains  popular  despite  its  known  limitations.  He  considers 
this  to  be  partly  due  to  the  fact  that  many  wind  project  sites  do  not 
have  steep  terrain  or  significant  mesoscale  circulations. 

Prasad  and  Bansal  [16],  using  data  compiled  from  various 
sources,  constructed  a  table  showing  wind  speed  uncertainty 


depending  on  the  approach  used  to  estimate  annual  mean  wind 
speed  and  mean  wind  speed  uncertainty  depending  on  the 
duration  of  the  wind  data  series  for  the  target  site.  According  to 
the  data  shown  in  the  table,  the  minimum  wind  speed  uncertainty 
is  obtained  with  the  use  of  WAsP.  However,  the  authors  consider 
[16]  that  each  of  the  different  wind  speed  forecast  methods  will 
have  its  pros  and  cons  at  a  particular  site  and  it  will  be  up  to  the 
planner  to  take  a  decision  as  to  which  method  is  the  most  suitable 
in  each  case.  They  point  out,  for  example,  that  WAsP  can  display  a 
high  degree  of  uncertainty  in  wind  speed  prediction  when  dealing 
with  a  site  of  complex  topography.  So,  MCP  methods,  which  do  not 
require  topographic  site  details,  would  appear  to  be  more  suitable 
for  sites  of  complex  topography.  They  also  make  clear  that  the 
uncertainty  of  MCP  methods  will  be  lower  if  the  duration  of  the 
wind  speed  measurement  campaign  at  the  target  site  is  extended. 
In  Refs.  [35-36],  the  authors  analyse  that  the  use  of  WAshwhen 
obstacles  are  found  near  the  anemometer  mast. 

MCP  methods  have  been  mainly  applied  in  estimation  of 
onshore  wind  resources  [37-39],  though  their  use  has  also  been 
proposed  for  offshore  wind  resource  estimation  [40-43]  and  the 
possibility  of  their  use  has  been  analysed  for  swell  estimation  [40], 
given  the  dependence  of  swell  on  wind  speed  [44],  It  can  be  seen 
that  MCP  methodology  has  spread  to  other  renewable  energy 
sources.  In  this  context,  studies  have  been  undertaken  with  the 


Wind  data 


Fig.  1.  Block  diagram  of  the  procedure  normally  employed  by  MCP  methods. 
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aim  of  examining  the  quality  and  consistency  of  some  MCP 
methods  in  the  estimation  of  long-term  solar  resources  (horizontal 
global  radiation)  at  a  target  site  [45]. 

Due  to  the  characteristics  of  the  variables  that  are  involved  in 
MCP  methods,  it  is  assumed  that  predictions  obtained  with  them 
are  subject  to  uncertainty  [15,46-53].  With  this  in  mind  and  given 
that  the  users  do  not  have  at  their  disposal  long-term  data  for  each 
site,  several  authors  have  proposed  the  use  of  statistical  models  to 
estimate  this  uncertainty  [46-48,52].  These  models  attempt  to 
estimate  MCP  prediction  uncertainty  based  on  the  relationship 
established  between  data  recorded  simultaneously  at  the  refer¬ 
ence  and  target  sites.  This  involves  assuming  that  the  variability  of 
the  relationship  between  the  data  from  the  reference  and  target 
sites,  as  seen  in  the  concurrent  data  set,  reflects  the  long-term 
variability  between  the  data  of  the  two  sites. 

3.3.  Aim  of  this  review 

A  review  is  undertaken  in  this  paper  for  a  wide  range  of  MCP 
methods  that  have  been  proposed  for  wind  energy  analysis,  some 
of  which  have  been  implemented  in  wind  energy  industry  soft¬ 
ware  [26-32]  and  analysed  in  academic  environments  [54-57], 
This  review  includes  the  initial  methods  proposed  in  the  1940  s 
which  generally  only  attempted  to  estimate  the  long-term  annual 
mean  wind  speed  from  a  single  reference  station,  and  extends  to 
methods  proposed  in  the  present  century  based  on  automatic 
learning  techniques  which  use  several  reference  stations  and 
which  are  not  as  yet  in  common  use  in  the  wind  industry. 

In  addition  to  describing  the  linear,  non-linear  and  probabilistic 
transfer  functions  used  by  the  different  algorithms,  the  hypotheses  on 
which  these  algorithms  are  based  and  the  data  format  with  which 
they  work  (time  series  or  frequency  distributions),  this  review  will  also 
examine  the  limitations  in  the  use  of  MCP  methods,  the  uncertainty 
associated  with  them  and  the  different  reference  data  sources  that 
have  been  proposed.  In  this  sense,  the  extensive  collection  of  MCP 
methods  which  have  been  brought  together  and  reviewed  in  this 
paper,  ranging  from  the  simplest  and  easiest-to-use  models  to  the 
most  complicated  computational  ones  which  require  specific  user 
experience,  comprises  an  extremely  useful  catalogue. 


2.  Determinants  of  MCP  methods 

It  is  important  to  note  that  MCP  methods  are  based  on  a  series 
of  hypotheses.  These  hypotheses  are  described  and  analysed  in  the 
following  sub-sections: 

2.1.  Appropriate  measurement  protocols 

MCP  methods  are  based  on  the  hypothesis  that  the  long-term  data 
of  the  reference  and  target  stations  and  the  short-term  data  of  the 
target  station  have  been  obtained  in  accordance  with  appropriate 
measurement  protocols  that  can  guarantee  their  accuracy  [5,7,9], 
Albers  and  Klug  [7]  report  that,  because  of  a  lack  of  experience,  many 
wind  speed  measurements  have  an  unacceptably  high  uncertainty  as 
proper  practices  were  not  observed  when  selecting  and  mounting 
anemometers,  when  choosing  the  measuring  site,  or  when  deciding 
upon  the  height  and  duration  of  the  measurements.  Before  using  wind 
data  series  in  an  MCP  analysis,  it  is  of  paramount  importance  that  a 
thorough  check  is  made  on  their  validity  [5], 

In  order  to  pass  the  consistency  test  that  this  hypothesis 
entails,  the  reference  stations  must  satisfy  a  series  of  requirements 
throughout  the  historical  (long-term)  reference  period  [11,58-59]: 


wind  farms  [60]  or  changes  to  the  vegetation  surrounding  the 
wind  masts  that  might  distort  the  relationship  between  the 
target  and  reference  site  wind  data. 

(b)  Neither  the  height  nor  location  of  the  wind  mast  can  have 
been  subject  to  modification  during  the  period. 

(c)  Probst  and  Cardenas  [61]  propose  that  the  heights  above 
ground  level  (agl)  at  which  the  data  are  collected  at  the 
reference  and  target  sites  should  be  similar.  In  a  typical  case 
in  which  the  reference  station  data  are  recorded  at  a  low 
height  (generally  10  m  agl)  and  the  target  site  station  data  at  a 
range  of  heights  between  40  and  60  m,  according  to  the 
authors,  it  has  been  shown  that  the  correlation  coefficient 
between  the  two  data  series  is  lower  than  that  obtained  when 
both  data  series  are  recorded  at  the  same  height  of  10  m  agl. 
They  state  that  this  is  because  there  can  be  substantial 
variation  between  daily  wind  speed  profiles  at  very  different 
heights.  The  authors  propose  downscaling  the  wind  speeds  at 
the  target  site  to  10  m  agl  before  calculating  the  parameters  of 
a  linear  regression  based  MCP  method.  Once  the  MCP  has  been 
applied  and  the  long-term  wind  speeds  estimated  at  the  target 
site,  these  can  then  be  rescaled  to  the  corresponding  height  of 
40  or  60  m.  The  authors  [61  ]  state  that  in  this  procedure  case- 
by-case  verification  should  be  made  that  the  errors  generated 
in  the  extrapolation  stage  do  not  eliminate  the  reduction  in 
the  prediction  error  obtained  as  a  result  of  the  downscaling. 
With  regard  to  this  point,  the  authors  of  this  review  would  like 
to  underline  this  recommendation  of  Probst  and  Cardenas 
concerning  case-by-case  verification,  as  in  some  cross¬ 
correlation  analyses  conducted  between  anemometer  stations 
installed  in  the  Canary  Islands  (Spain)  results  were  obtained 
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(a)  The  weather  station  data  must  not  have  been  affected  by 
factors  such  as  the  construction  of  buildings,  installation  of 


Fig.  2.  Daily  wind  speed  profiles,  at  different  heights  above  ground  level,  from  two 
anemometer  stations  installed  on  the  island  of  Gran  Canaria  (Spain).  Correlation 
coefficients  between  the  wind  speeds  recorded  in  2010  at  the  various  heights. 
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which  do  not  confirm  the  wisdom  of  applying  the  downscaling 
strategy  proposed  in  [61],  For  example,  it  can  be  observed  in 
Fig.  2  how  the  cross-correlations  made  between  hourly  mean 
wind  speeds  recorded  in  2010  at  a  target  site  (installations 
belonging  to  the  Canary  Islands  Technological  Institute)  and 
reference  site  (Gran  Canaria  airport),  some  13  km  apart,  dis¬ 
play  certain  discrepancies  with  the  results  indicated  in  [61], 
There  are  no  significant  differences  between  the  correlation 
coefficients  of  the  reference  station  data  measured  at  10  m  agl 
and  the  data  of  the  target  site  station  measured  at  different 
heights  (10,  20,  40  and  60  m  agl).  The  highest  correlation 
coefficient  between  the  two  stations  is  obtained  between  the 
data  recorded  at  10  m  agl  (reference)  and  20  m  agl  (target). 
However,  the  correlation  coefficients  obtained  between  the 
data  recorded  at  the  target  site  station  at  different  heights 
(10,  20,  40  and  60  m  agl)  reflected  what  was  expected.  That  is, 
as  also  observed  by  Jain  [2],  the  lowest  correlation  was  found 
between  the  measurements  at  60  m  and  10  m  agl  (r=  0.987)  as 
a  result  of  surface  roughness  of  the  ground.  In  addition, 
although  ideally  a  measure  of  atmospheric  stability  (stable, 
neutrally  stable  or  unstable)  could  be  used,  sufficient  informa¬ 
tion  may  not  be  available  to  allow  its  calculation  and  in  many 
situations  the  vertical  wind  profile  models  (logarithmic  law 
and  power  law)  usually  only  consider  neutral  stratification 
[2,9,16],  For  this  reason,  Oliver  and  Zarling  [621  propose  a 
procedure  to  replace  the  effect  of  atmospheric  stability,  which 
consists  of  calculating  the  correlation  according  to  the  hour  of 
the  day,  given  the  influence  of  daily  changes  in  atmospheric 
thermal  stratification  on  vertical  wind  profiles  [63], 

(d)  The  wind  data  must  been  recorded  with  essentially  the  same 
equipment. 


2.2.  Similar  wind  climate  at  the  reference  and  target  sites 


This  assumption  is  based  on  the  idea  that  the  reference  station 
data  are  representative  of  the  wind  climate  of  the  target  site. 
Basically,  this  is  usually  equated  in  the  literature  to  the  existence  of 
a  good  correlation  between  the  reference  station  and  target  site 
station.  However,  other  types  of  verification  are  also  commonly 
performed.  These  include  comparisons  between  daily  and  monthly 
wind  speed  profiles  and  comparisons  between  the  wind  roses 
recorded  at  the  reference  and  target  sites.  Appreciable  differences 
in  these  comparisons  or  low  coefficients  of  correlation  can  lead  to 
rejection  of  the  reference  station  for  its  use  in  MCP  methodologies 
[2,64-65], 

The  correlation  is  often  quantified  by  calculating  the  Pearson 
product-moment  coefficient  of  linear  correlation  (otherwise 
known  as  Pearson  s  correlation  coefficient),  r.  In  this  case,  Pear¬ 
son’s  correlation  coefficient  measures  the  strength  and  direction 
(rising  or  falling,  depending  on  whether  the  sign  is  positive  or 
negative,  respectively)  of  the  linear  association  between  the  wind 
speeds  recorded  at  the  reference  station  and  target  site  station 
during  the  concurrent  data  period  [2,5,9],  Its  value  is  given  by  the 
following  equation  [66-69]: 


ZL  i[(Vi)f-vf  ][(Vj)f-vf  ] 
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where  (v,))7  and  (v,)fT  are  the  observed  short-term  wind  speeds  at 
the  reference  and  target  sites,  respectively.  vfT  and  v^T  are  the 
respective  means. 

According  to  a  number  of  references  [2,5,9],  if  the  correlation  is 
weak  the  results  can  be  misleading  and,  in  such  a  case,  the  long¬ 
term  data  from  the  reference  station  should  be  discarded  or  used 
with  precaution. 


Clarification  should  be  made  at  this  point  that  the  use  of 
Pearson's  correlation  coefficient  r  to  measure  the  degree  of 
association  between  the  wind  speeds  of  the  target  and  reference 
site  can  be  a  valuable  tool.  However,  certain  precautions  need  to  be 
taken,  as  r  is  neither  robust  nor  resistant  [69].  It  is  not  robust 
because  strong  but  non-linear  relationships  between  wind  speeds 
at  the  two  sites  may  not  be  recognised,  and  it  is  not  resistant 
because  it  can  be  extremely  sensitive  to  one  or  a  few  pairs  of 
outliers.  In  other  words,  the  value  of  r  can  be  seriously  affected  by 
the  presence  of  a  few  odd  observations  which  are  found  far  from 
the  bulk  of  the  data,  Fig.  3.  These  ‘odd’  observations  can  generate  a 
high  asymmetry  in  the  distributions  of  the  two  variables,  and  this 
asymmetry  has  a  profound  effect  on  the  correlation  coefficient. 


Wind  speed  at  the  airport  (m/s) 


Fig.  3.  Effect  on  the  correlation  coefficient  of  the  data  recorded  during  2010  by  two 
anemometer  stations  installed  on  the  island  of  Gran  Canaria  (Spain)  at  10  m  agl 
(Airport)  and  60  m  agl  (ITC),  when  outliers  are  introduced  which  produce  asym¬ 
metry  in  both  variables. 


Wind  speed  at  the  airport  (m/s) 


Fig.  4.  Effect  on  the  correlation  coefficient  of  the  data  recorded  during  2010  at  two 
anemometer  stations  installed  on  the  island  of  Gran  Canaria  (Spain)  at  10  m  agl 
(Airport)  and  60  m  agl  (ITC),  when  outliers  are  introduced  which  produce  asym¬ 
metry  in  just  one  of  the  variables. 
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In  this  case,  the  impact  produced  on  r  by  the  ‘odd’  observations  is 
commonly  called  the  “King-Kong”  or  “Big  Apple”  effect  [67-68], 
This  effect  can  substantially  increase  the  value  of  r.  Sometimes  an 
outlier  can  produce  an  asymmetric  distribution  in  just  one  of  the 
variables,  lowering  the  value  of  r  [67],  Fig.  4.  Finally,  it  should  be 
noted  that  the  value  of  r  only  indicates  the  extent  to  which  the 
target  and  reference  wind  speeds  are  linearly  associated,  r  in  itself 
does  not  explain  any  relationship  between  the  wind  speeds  at 
these  sites,  at  least  not  in  any  physical  or  causative  sense4  [66,67], 

McKenzie  et  al.  [70]  argue  that  not  all  MCP  methodologies  have 
a  linear  nature  and  it  is  therefore  possible  that  ways  of  measuring 
the  correlation  other  than  Pearson's  may  provide  a  more  precise 
description  of  the  relationship  between  the  wind  data  of  the  two 
sites.  They  claim,  for  example  that  a  more  appropriate  method 
would  be  to  use  a  non-parametric  measure  of  association  such  as 
Spearman's  rank  correlation  coefficient,  rs,  [71],  This  coefficient 
can  be  determined  through  Eq.  (1),  replacing  the  observations  of 
the  wind  speeds  recorded  at  the  two  sites  with  the  ranks.  Whereas 
Pearson's  correlation  coefficient  is  a  measure  of  the  degree  of 
linear  association  that  exists  between  the  wind  speeds  recorded  at 
the  target  and  reference  sites,  Spearman's  rank  correlation  coeffi¬ 
cient  measures  the  trend.  In  a  sense,  rs  has  higher  significance  than 
r  because,  when  measuring  the  degree  of  monotonic  association 
between  the  wind  speeds,  rs  is  not  restricted  to  finding  a  linear 
association  between  them  [71], 

Several  studies  have  been  made  in  relation  to  the  influence  on 
the  correlation  coefficient  of  the  time  scale  of  the  wind  data  series 
and  the  spatial  distances  between  the  reference  and  target 
anemometer  stations  [33,34,72,73],  Nielsen  et  al.  [74]  state  that 
before  using  an  MCP  method  consideration  should  be  given  to  the 
possible  need  for  time  averaging  of  the  data.  If  short  time  averages 
(approximately  10  min)  are  used  but  the  stations  are  not  closely 
situated,  probably  no  correlation  will  be  found  between  the  wind 
speeds  as  a  result  of  variations  in  the  short  time  scale.  Ayotte  et  al. 
[72]  undertook  a  simple  time  and  spatial  analysis  of  wind  speed 
and  direction  data  obtained  from  a  number  of  weather  towers 
separated  by  distances  ranging  from  1  to  100  km.  For  this  purpose, 
they  developed  a  band-limited  correlation  coefficient  in  the 
frequency  domain,  performing  discrete  Fourier  transforms  [75] 
of  the  time  series.  Ayotte  et  al.  [72]  note  that  there  is  a  close 
relationship  between  the  distance  that  separates  the  two  sites 
where  the  wind  data  were  recorded  and  the  correlation  that  exists 
between  the  data.  This  in  turn  means  there  will  be  a  relationship 
between  that  distance  and  the  accuracy  that  is  expected  when 
predicting  the  averaged  wind  speed  and  direction  values  based  on 
these  recorded  data. 

Bowen  and  Mortensen  [34]  obtained  interesting  results  in  their 
study,  though  these  were  not  conclusive.  Bearing  in  mind  the 
significant  distance  between  most  of  the  sites  analysed,  they 
concluded  that  cross-correlation  coefficients  with  zero  time  lag, 
calculated  with  10-min  averaged  wind  speed  series,  might  not  be 
suitable,  deciding  that  a  longer  average  time  (specifically  1  h) 
might  be  more  appropriate  in  these  cases.  However,  they  only 
observed  a  small  improvement  in  the  cross-correlation  coeffi¬ 
cients,  Eq.  (2),  with  a  time  lag  k  equal  to  zero  for  1-hour  averaged 
wind  speeds,  compared  to  the  coefficients  obtained  with  10-min 
averaged  speeds.  The  authors  confess  that  they  did  not  look  into 
this  question  in  great  depth  and  did  not  resolve  it: 


r(k)  = 
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In  the  cross-correlation  equation,  c(k)  is  given  bythe  following 
equation: 

C(k)  =  1  Zn  =  1  [(Vttjf  -vf  ][(vtt+k)f -vf  ]  (3) 

where  ( vft  )f T  and  (vtt);!r  are,  respectively,  the  short-term  target  and 
reference  site  wind  speeds  at  time  tt,  sfT  and  s^J  are  their  standard 
deviations  and  v^T  and  vfT  are  the  mean  values,  k(  =  0, 1,  2,  3,...Kr) 
is  the  time  lag,  n  is  the  number  of  data  and  Kr  is  the  number  of  lags 
considered. 

Frith  [73]  calculated  the  cross-correlations  between  hourly 
mean  wind  speeds  of  seven  anemometer  stations  in  Scotland. 
According  to  the  author,  the  results  were  as  expected.  Good 
correlation  coefficients  were  observed  between  r=0.67  and 
r=0.84  with  0  or  1  h  time  lag.  The  correlation  coefficient  between 
the  westernmost  and  easternmost  sites  fell  to  0.52  with  a  3  h  time 
lag.  Friih  [73]  observed  that  the  range  of  time  lags  for  the  best 
correlation  between  two  sites  was  consistent  with  the  approxi¬ 
mately  south-westerly  prevailing  wind.  Lamberg  and  Mortensen 
[33],  understanding  that  for  proper  use  of  MCP  methods  a  good 
correlation  between  target  and  reference  wind  data  is  vital,  also 
investigated  this  question  in  a  study  of  15  possible  cross¬ 
correlations  between  10-min  averaged  wind  speeds  from  six  sites. 
As  all  the  correlation  functions  in  their  studies  decreased,  they 
concluded  that  it  was  not  necessary  to  take  into  consideration  any 
time  lag. 

No  definitive  and  corroborated  scientific  criterion  has  been 
published  in  renewable  energy  related  literature  concerning  the 
correlation  level  below  which  reference  station  data  should  not  be 
used  due  to  their  unreliability  [2,5].  However,  some  rules  of  thumb 
have  been  proposed  [2,5,19,58,65,76].  According  to  Ref.  [19],  the 
coefficient  of  determination  [75],  R2,  should  not  be  lower  than  70%. 
R2  is  defined  in  general  terms  as  the  ratio  between  the  variability 
explained  by  the  regression  and  the  total  variability,  as  seen  in  the 
following  equation: 


,2  £Ll[ivi)f-'f] 
zr= ,  [(v,)fr-vfT]2 


R2  measures  the  proportion  of  the  total  variation  about  the 
mean,  (v)fT.  explained  by  the  regression.  Often,  R2  is  expressed  as  a 
percentage  and  is  the  square  of  what  is  usually  called  the  multiple 
correlation  coefficient.  In  fact,  R  is  the  correlation  between  the 
measured  short-term  data,  (v,)fr,  and  the  short-term  data  as 
estimated  by  the  model,  (v,)fr. 

In  Ref.  [5],  it  is  reported  that,  as  a  general  rule,  when  monthly 
and  in  all  directions  the  square  of  Pearson's  correlation  coeffi¬ 
cient,5  r 2,  of  the  wind  speed  is  less  than  0.8  there  is  substantial 
uncertainty  as  to  whether  the  reference  station  long-term  data 
should  be  used  to  estimate  long-term  wind  conditions  at  the 
target  site. 

According  to  Bass  [76],  care  should  generally  be  taken  that  the 
coefficient  of  correlation  is  higher  than  0.7  and  according  to  Jain 
[2]  if  the  value  of  the  coefficient  of  correlation  is  0.9  or  higher  the 
correlation  is  considered  to  be  excellent.  A  guide  is  also  provided 
in  Ref.  [2]  for  the  acceptable  correlations  when  it  comes  to 
determining  whether  two  wind  speed  time  series  share  the  same 
wind  climate.  So,  for  example,  a  correlation  of  raw  data  between 
wind  speed  series  with  10-min  intervals  above  0.65  or  a  correla¬ 
tion  of  daily  average  data  of  those  series  above  0.75  is  considered 
acceptable.  In  the  study  undertaken  in  [65],  the  sites  which  were 
considered  clearly  unsuitable  for  an  MCP  method  were  discarded. 
So,  the  appropriateness  or  inappropriateness  of  using  an  MCP 


4  Despite  r  being  neither  robust  nor  resistant  it  has  been  widely  used,  probably 
because  its  form  lends  itself  well  to  mathematical  manipulation  and  because  it  is 
closely  related  to  the  linear  regression  used  in  MCP  methods. 


5  Only  in  the  case  of  a  simple  linear  regression  is  the  coefficient  R  between 
(vpf  and  (v,jf  exactly  the  same  as  the  correlation  coefficient  r  between  (v,)fr  and 
(v,)f.  In  other  cases,  the  relationship  R2  =  r2  does  not  hold. 
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method  is  based  on  the  level  of  correlation.  In  this  context,  overall 
correlation  coefficients6  of  0.5-0.6  are  considered  to  be  very  poor, 
and  those  with  ranges  of  0.6-0.7,  0.7-0.8,  0.8-0.9  and  0.9-1.0  are 
considered,  respectively,  poor,  moderate,  good  and  very  good. 
These  ratings  which  are  dependent  on  the  correlation  coefficient 
value  are  also  set  out  in  Ref.  [58], 

2.3.  Knowledge  of  the  pattern  of  seasonal  variations  in  the 
concurrent  data  period 

The  short-term  period  over  which  wind  data  is  available  for  both 
the  reference  and  target  sites  must  be  at  least  long  enough  to  allow 
information  to  be  extracted  with  regards  to  seasonal  wind  variations 
(speed  and  direction).  The  general  recommendation  is  for  this 
concurrent  data  period  to  be  at  least  one  year  long.  Taylor  et  al.  [49] 
undertook  a  study  to  analyse  the  long-term  prediction  variability  of 
MCP  methods  against  annual  data.  For  this  purpose,  they  used  sub¬ 
periods  of  1,  3,  6, 12, 18  and  24  months.  Their  results  suggest  that  one 
month  of  data  gives  rise  to  a  margin  of  uncertainty  (relative  standard 
deviation)  of  between  6.5%  and  about  12%  for  long-term  wind  speed 
estimations  at  the  analysed  stations.  For  the  three  and  six  month 
periods,  in  spite  of  the  existence  of  an  inverse  relationship  between 
the  uncertainty  and  the  data  measurement  period,  the  difference 
between  the  margins  of  uncertainty  is  still  quite  high  (approximately 
4%  for  the  6-month  sub-period).  However,  once  the  data  period  is 
extended  to  12  months  there  is  a  considerable  fall  in  uncertainty  and 
the  difference  between  the  margins  of  uncertainty  is  significantly 
smaller  (approximately  2%).  Few  changes  are  noted  in  uncertainties  in 
the  analysis  of  the  18-month  sub-period,  but  the  24-month  period 
sees  a  reappearance  of  the  trend  towards  lower  uncertainty.  The 
authors  conclude  that  the  results  obtained  from  their  study  provide 
evidence  that  there  is  a  significant  seasonal  influence  on  the  quality  of 
long-term  estimations.  Consequently,  they  underline  the  importance 
of  having  at  least  12  months’  (or  multiples  thereof)  worth  of 
measurements  for  the  target  site.  Oliver  and  Zarling  [77]  state  that 
long-term  wind  speed  predictions  made  with  less  than  one  year's 
worth  of  data  for  the  site  can  be  subject  to  large  errors.  The  clear 
seasonal  patterns  that  appeared  in  a  study  carried  out  by  these 
authors  using  14  station  pairs  (target-reference)  situated  in  the  “wind 
corridor”  from  Texas  to  Dakotas,  showed  the  importance  of  taking  into 
consideration  exactly  when  during  a  year  a  measurement  campaign  is 
initiated  when  making  a  prediction  of  wind  speed  for  data  sets  of  less 
than  one  year's  duration.  Often,  business  considerations  [4,15,77]  will 
prevent  a  year's  wait  for  data  records  and  will  require  predictions  to  be 
made  on  the  basis  of  less  than  one  year's  data  for  the  target  site.  In 
these  cases,  a  greater  understanding  is  required  for  the  effects  of 
seasonal  fluctuations  on  the  bias  of  the  wind  speed  prediction  so  that 
a  more  informed  analysis  can  be  made  of  the  long-term  wind  speed 
prediction  [77]. 

2.4.  Climate  stability 

MCP  methods  work  on  the  assumption  that  the  data  sets  are 
statistically  stationary.  That  is,  the  effects  of  climate  change  are  ignored 
and  it  is  assumed  that  wind  behaviour  in  the  future,  during  the  life 
cycle  of  the  energy  project,  will  be  similar  to  that  of  the  past.  Wind 
and  climate  variabilities  are  closely  linked.  In  scientific  circles,  it  is 
generally  reckoned  that  as  a  result  of  global  warming  the  difference  in 
temperature  between  the  poles  and  the  equator  will  be  reduced  and 
mid-latitude  winds  will  also  undergo  some  modifications  [63,78]. 
Nonetheless,  it  should  be  pointed  out  that  in  the  specialist  literature 
consulted  by  the  authors  of  this  review  there  exists  a  certain  amount 


6  These  come  from  the  correlation  calculated  for  each  wind  direction  sector 
weighted  with  the  frequency  of  this  distribution  sector  at  the  reference  site. 


of  controversy  as  to  the  variation  experienced  in  the  last  few  decades 
by  wind  resources  throughout  the  world  as  a  result  of  climate  change 
or  alterations  to  surface  roughness  of  the  ground.  A  number  of 
researchers  have  reported  that  surface  winds  have  fallen  in  China, 
the  Netherlands,  the  Czech  Republic,  the  United  States  and  Australia  in 
the  last  few  decades  [79-80].  Vautard  et  al.  [81  ]  analysed  the  extent 
and  possible  cause  of  surface  wind  speed  changes  in  the  northern 
mid-latitudes  between  1979  and  2008  using  data  from  822  surface 
weather  stations.  They  reported  that  surface  wind  speeds  declined 
between  5%  and  15%  in  almost  all  continental  areas  in  the  northern 
mid-latitudes,  and  that  strong  winds  slowed  faster  than  weak  winds. 
The  mesoscale  model  simulations  that  they  used  suggest  that  an 
increase  in  surface  roughness  could  explain  reduction  of  the  wind 
between  25%  and  60%.  However,  other  authors  [9,82]  claim  that  there 
is  thus  far  little  reliable  evidence  of  a  significant  rise  or  fall  in  wind 
resources  in  the  world  over  the  past  few  decades  as  a  result  of  climate 
change.  Several  studies  have  been  carried  out  with  the  aim  of 
analysing  potential  changes  in  wind  speed  in  the  future  as  a  result 
of  climate  change  [77-87].  Breslow  and  Sailor  [83]  investigated 
potential  impacts  throughout  the  continental  US.  For  this  purpose, 
they  used  two  general  circulation  models  which  were,  in  general, 
consistent  in  the  prediction  that  the  US  will  see  a  reduction  compared 
with  the  1948-1975  period  in  the  mean  annual  wind  speed  of 
between  1.0%  and  3.2%  in  the  next  50  years  and  between  1.4%  and 
4.5%  in  the  next  100  years,  as  a  consequence  of  the  response  of  the 
atmosphere  to  climate  change  induced  by  the  increase  in  C02 
concentration.  On  a  regional  scale,  the  two  models  showed  some 
similarities  in  the  first  years  of  simulations  (2050),  but  diverged 
significantly  in  their  predictions  for  2100.  So,  there  is  clearly  a  lot  of 
uncertainty  as  to  how  wind  fields  will  change  in  the  future  [84],  Sailor 
et  al.  [85]  used  statistically  ‘downscaled’  outputs  of  four  general 
circulation  models  (GCMs)  to  investigate  scenarios  of  the  impact  of 
climate  change  on  wind  energy  generation  in  five  states  in  the 
northwest  of  the  United  States.  The  results  suggest  that  summer  wind 
speeds  in  the  northwest  may  fall  by  5-10%,  whereas  winter  wind 
speeds  will  suffer  only  a  moderate  decline  or  may  even  rise  slightly. 
Watson  and  Kritharas  [86]  used  historical  variability  (53  years)  of  wind 
speed  in  the  United  Kingdom  in  order  to  evaluate  the  uncertainties  in 
long-term  assessment  of  the  wind  resource.  Their  analysis  of  long¬ 
term  variability  was  carried  out  through  the  construction  of  a  wind 
index  based  on  53  years  of  wind  speed  data  from  seven  surface 
stations  and  27  years  of  wind  speed  data  from  57  stations  in  the  UK. 
This  index  was  also  used  to  construct  six  regional  indices.  The  authors 
concluded  that  the  long-term  data  of  53  years  do  not  appear  to 
suggest  any  perceptible  trend  towards  an  increase  or  decrease  in  wind 
speed  with  time.  The  trends  observed  in  the  regional  indices  were  not 
incompatible  with  the  predictions  of  some  regional  climate  models, 
but  long-term  trends  for  the  prediction  of  future  wind  speeds  remain 
very  uncertain  and  there  exists  significant  interannual  variation  in  the 
observed  data. 

Pryor  and  Barthelmie  [87]  report  that  current  state-of-the-art  does 
not  suggest  the  existence  of  detectable  changes  to  the  wind  resource 
or  other  external  conditions  that  could  put  at  risk  continued  wind 
energy  exploitation  in  the  north  of  Europe,  though  more  research  is 
required  to  ensure  a  greater  degree  of  confidence  in  such  projections. 
It  can  be  concluded  from  the  results  of  the  various  studies  that  have 
been  carried  out  that,  as  indicated  by  Brower  [9],  the  effect  of  climate 
change  on  wind  speed  within  the  time  horizon  of  wind  project 
investments  (i.e.  up  to  25  years)  will  probably  be  low. 


3.  Proposed  reference  data  sources 

Given  the  existence  of  sites  where  the  restrictions  listed  in 
Section  2  limit  the  surface  weather  station  reference  data  and  the 
length  of  the  reference  periods  that  can  be  used,  other  data 
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sources  have  been  analysed  [59,88,89]  and  studies  undertaken  to 
find  out  whether  the  use  of  reanalysis  data  might  constitute  an 
alternative  approach  [9,90-92],  One  alternative  source  for  refer¬ 
ence  data  are  data  from  rawinsonde  measurements  [59,88], 
However,  despite  the  consistency  and  reliability  of  these  sources, 
there  are  considerably  fewer  of  this  type  of  station  compared  to 
surface  stations  and,  generally,  they  collect  data  only  on  a  12 
hourly  basis  at  heights  above  the  planetary  boundary  layer. 
Consequently,  rawinsonde  data  do  not  correlate  as  well  as  surface 
station  data  with  measurements  taken  at  wind  project  sites  [89], 

Reanalysis  is  a  method  used  to  construct  a  record  of  climate 
data  in  a  three-dimensional  global  or  regional  grid  which  com¬ 
bines  past  observations  from  different  observation  and  measure¬ 
ment  systems  (surface  weather  stations,  satellites,  rawinsonde, 
etc.)  with  a  numerical  weather  prediction  (NWP)  model  that 
creates  a  physically  consistent  picture  of  how  the  earth's  climate 
has  evolved  over  time. 

The  most  well-known  reanalysis  data  are  generated  by  the 
National  Centres  for  Environmental  Prediction  (NCEP)  [93]  and  the 
National  Centre  for  Atmospheric  Research  (NCAR)  [93]  in  the  US 
and  the  European  Centre  for  Medium-Range  Weather  Forecasts 
(ECMWF)  [94]  in  Europe. 

According  to  Brower  [90],  there  are  two  main  reasons  that 
suggest  that  reanalysis  data  might  be  attractive  for  use  in  MCP 
methods.  First  is  their  ease-of-use,  since  the  data  are  open  for 
public  use  in  a  global  network  and  there  is  no  need  to  search  for 
suitable  weather  stations  or  compare  several  different  stations  in 
order  to  find  the  best  one.  The  second  reason  is  that  the  data 
record  is  longer  than  that  available  for  most  weather  stations. 

However,  Brower  [90],  as  the  main  conclusion  of  studies 
undertaken  with  reanalysis  data  and  direct  observations,  reports 
that  NCAR/NCEP  reanalysis  data  are  not  sufficiently  reliable  for  use 
in  MCP  methods.  Though  the  data  reference  periods  are  relatively 
short  (10  years  or  less),  the  false  fluctuations  and  trends  that  can 
occur  in  reanalysis  data  often  result  in  bigger  errors  than  those 
obtained  with  the  conventional  method.  So,  it  is  concluded  [59,90] 
that  although  these  errors  do  not  result  in  all  cases,  bearing  in 
mind  the  importance  of  MCP  in  energy  production  estimation 
prior  to  the  construction  of  a  wind  farm,  NCAR/NCEP  reanalysis 
data  should  not  be  used  for  this  purpose. 

Lileo  and  Petrik  [91]  investigated  the  applicability  of  wind  data 
from  three  different  reanalysis  data  sets  as  long-term  references  in 
MCP  analyses.  In  addition  to  NCEP/NCAR  data,  which  have  been 
commonly  used  in  wind  resource  analyses  over  the  past  decade, 
they  also  used  in  their  study  MERRA  (Modern  Era  Retrospective- 
analysis  for  Research  and  Applications)  [95]  and  NCEP/CFSR 


(National  Centres  for  Environmental  Prediction  /Climate  Forecast 
System  Reanalysis)  data  [96],  In  their  study  they  only  analysed  grid 
data  covering  the  territory  of  Sweden. 

The  results  obtained  by  the  authors  [91  ]  suggest  that  the  use  of 
MERRA  and  NCEP/CFSR  reanalysis  data  sets  represents  a  relevant 
improvement  in  the  accuracy  of  the  MCP  results.  The  higher 
spatial  and  temporal  resolutions  of  the  reanalysis  data  sets  in 
comparison  with  the  NCEP/NCAR  data  allow  better  representation 
of  the  local  wind  climate,  as  is  demonstrated,  according  to  the 
authors,  by  the  high  levels  of  correlation  with  the  data  records 
obtained  with  the  anemometers  installed  on  local  masts. 

Pinto  et  al.  [92]  compared  NCEP/NCAR  reanalysis  data  with 
local  measurements  from  20  wind  measuring  stations  spread  out 
over  Portuguese  territory.  Among  the  different  conclusions  made 
in  the  study,  they  report  that  the  6  h  data  from  the  reanalysis 
project  displayed  a  poor  correlation  with  the  measurements  of  the 
local  stations  as  a  result  of  the  high  variability  in  both  data  series. 
However,  they  indicate  that  the  correlation  tended  to  improves 
when  the  data  were  compared  on  a  monthly  and  annual  basis. 

Taylor  et  al.  [89]  presented  windTrends,  a  database  of  meteor¬ 
ological  conditions  covering  the  years  1997-2009.  WindTrends 
offers  a  snapshot  of  the  climate  each  hour  at  different  heights  agl 
in  a  regular-sized  grid  with  20-km  resolution  (around  100,000 
data  records  are  available  for  each  height  and  20-km  point).  The 
authors  provide  a  summary  of  the  results  they  obtained  in  a 
thorough  evaluation  of  the  suitability  of  the  data  for  use  in  MCP 
methods  and  other  applications.  Though  conceptually  similar  to 
reanalysis,  the  process  differs  in  two  fundamental  aspects:  grid 
resolution  is  finer  and  the  observational  data  source  -  rawinsonde 
only  -  remains  the  same  throughout  the  simulation  period. 
According  to  the  authors,  the  validation  study  they  presented 
shows  that  windTrends  is  generally  superior  to  global  reanalysis 
[97]  and  comparable  with  ASOS  (Automated  Surface  Observing 
System)  stations  for  the  purpose  of  use  in  MCP  methods. 


4.  Overview  of  proposed  measure-correlate-predict  models 

4.3.  Considerations  about  wind  speeds  used 

It  should  be  noted  that  most  MCP  methods,  for  the  purpose  of 
improving  predictability,  do  not  use  the  full  range  of  available 
wind  data  [2,5,33,65,98-99].  Filtering  of  the  wind  speed  data  is 
usually  proposed  along  with  rejection  of  data  that  fall  below  a 
certain  threshold  value  [2,5,33,65,98]  or  different  treatments  of 
values  below  this  threshold  [98,100],  According  to  some  authors 


N 


Fig.  5.  Wind  direction  sectors  of  equal  size,  commonly  used  in  MCP  methods. 
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Fig.  6.  Results  of  applying  two  methods  of  determining  the  number  and  size  of  the  wind  direction  sectors  which  avoid  the  existence  of  sectors  with  an  insufficient  number  of 
data  to  the  hourly  mean  data  recorded  at  10  m  agl  during  2010  at  two  anemometer  stations  (Airport  and  ITC)  installed  on  the  island  of  Gran  Canaria  (Spain),  (a)  Method 
which  aims  for  each  sector  to  have  at  least  1/24  of  the  total  number  of  data,  (b)  Method  which  aims  to  obtain  the  maximum  overall  coefficient  correlation. 


[2,5,33],  wind  speeds  below  3  m/s  are  not  important  from  an 
energy  point  of  view  and  can  skew  the  functional  relationship. 
However,  in  the  analysis  undertaken  in  [65]  calms  (wind  speeds 
below  0.1  m/s)  constituted  this  threshold  value. 

4.2.  Considerations  about  the  influence  of  wind  direction 

Generally,  in  the  most  widely  used  MCP  methods  in  the  wind 
industry  or  proposed  in  the  scientific  literature,  the  relationships 
established  between  the  target  and  reference  stations  are  not 
constructed  using  the  whole  mass  of  data  together.  Instead,  to  take 
into  consideration  the  effects  of  wind  direction,  the  data  are 
binned  or  grouped  in  sectors  defined  by  the  reference  site  wind 
direction  and  a  relationship  is  then  constructed  for  each  different 
sector.  The  wind  directions  recorded  at  the  reference  and  target 
sites  are  compared  to  determine  whether  there  exist  local  char¬ 
acteristics  which  might  have  an  influence  on  the  directional 
results  [5],  If  the  differences  between  the  wind  direction  records 
at  the  target  and  reference  sites  are  substantial  various  approaches 
may  be  taken  [61].  Woods  and  Watson  [101],  Velazquez  et  al.  [102] 
and  Carta  et  al.  [103]  have  also  proposed  other  approaches  which 
are  described  in  Sections  4.4.2  and  4.5. 

The  direction  sectors  are  usually  of  equal  size,  with  12  sectors 
of  30°  commonly  being  employed  [5,98,104-106],  Fig.  5.  However, 
care  should  be  taken  when  defining  the  number  of  direction 
sectors,  as  it  may  happen  that  there  are  sectors  for  which  an 
insufficient  number  of  data  are  available  to  provide  a  reliable  fit.  To 
reduce  the  effect  of  this  problem,  Riedel  et  al.  [107]  and  King  and 
Hurley  [98]  propose  that  the  direction  sectors  should  be  deter¬ 
mined  dynamically,  in  such  a  way  that  each  sector  contains  an 
equal  number  of  data  or  an  equal  fraction  of  the  total  data.  More 
specifically,  Riedel  et  al.  [107],  in  studies  carried  out  with  hourly 
and  10-min  mean  wind  speeds,  report  that  they  obtained  good 
results  when  each  sector  contained  at  least  1/24  of  the  total  data. 


A  method  called  “RES-OptiSector”  is  described  in  a  final  report 
[104]  which  describes  the  work  undertaken  in  a  research  project 
entitled:  “An  improved  measure-correlate  predict  algorithm  for 
the  prediction  of  the  long  term  wind  climate  in  regions  of  complex 
environment”,  funded  in  part  by  the  European  Commission  within 
the  framework  of  the  Non-Nuclear  Energy  Programme  JOULE  III 
(Programme  Acronym:  FP4-NNE-JOULE  C)  [108]  (contract  number 
JOR3-CT98-0295).  This  method  automatically  identifies  the  opti¬ 
mum  number  of  direction  sectors  and  their  location  with  the  aim 
of  maximising  the  overall7  Pearson's  correlation  coefficient,  r,8 
between  the  concurrent  wind  speed  data  of  the  reference  and 
target  sites.  For  this  purpose,  the  researchers  tested  various 
optimisation  algorithms  and  the  most  robust  was  found  to  be 
one  based  on  a  Monte  Carlo  simulation.  The  algorithm  starts  by 
calculating  the  value  of  r  for  just  one  sector.  It  then  determines  and 
stores  the  optimum  value  of  r  of  the  set  of  tests  generated  with 
two  sectors,  repeating  this  process  by  adding  one  sector  until  a 
total  of  12  has  been  reached,  Fig.  6.  In  [104],  a  method  called  “RES 
Fourier"  is  described  which  attempts  to  avoid  the  problem  of  a 
wind  direction  sector  containing  just  a  few  data  points.  For  this, 
the  use  was  proposed,  as  an  alternative  to  the  classical  linear 
regression,/]  ]  (with  slope  /i  and  offset  a),  of  the  function  shown  in 
the  following  equation: 


C Vjft  =  <J  a0  +  2  cik  sin  [(0,)/  1  +  bk  cos  [(0,)?  ]  f/[(W  >  (m  ]  (5) 


The  number  of  considered  equal  direction  sectors,  (0,)fr,  was 
1=36  and  the  number  of  independent  parameters  [2M(afc,  bk)  + 
l(a0)  +  2 (a,/?)]  used  was  19.  These  parameters  were  determined  by 


7  These  come  from  the  correlation  calculated  for  each  wind  direction  sector 
weighted  with  the  frequency  of  this  distribution  sector  at  the  reference  site. 

8  Eq.  (1)  is  used,  but  the  variables  refer  to  each  direction  sector. 
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minimising  the  chi-square  statistic  with  the  simplex  method  [75], 
As  reported  in  [104],  the  number  of  independent  parameters  was 
obtained  after  reaching  a  reasonable  compromise  between  accu¬ 
racy  and  computational  speed. 

Oliver  and  Zarling  [62]  propose  the  binning  or  grouping  of  data 
by  time  of  day  as  a  way  of  considering  the  effect  of  atmospheric 
stability.  These  authors  [62]  analysed  19  pairs  of  target- reference 
sites  with  at  least  two  years'  worth  of  concurrent  data.  They 
grouped  the  data  into  12  divisions  of  the  day  (each  division 
comprising  2  h)  and  compared  the  results  with  the  traditional 
approach  of  grouping  into  12  wind  direction  sectors  and  with 
combinations  of  both  approaches. 

They  concluded  that,  in  most  cases,  choosing  a  combination  of 
six  direction  sectors  (60°  each  sector)  and  two  time  groups  (12  h 
each  group),  instead  of  the  traditional  approach  of  12  direction 
sectors  (30  each  sector),  gave  a  better  prediction,  it  was  postu¬ 
lated  that  by  maintaining  the  traditional  12  direction  groups  and 
adding  2  or  3  time-based  groups  more  the  predictions  will  be 
improved  while  preserving  sufficient  data  points  in  each  group.  It 
was  also  considered  that  some  further  improvement  may  be 
achieved  by  varying  the  time  group  widths  depending  on  the 
month  of  the  year. 

4.3.  Estimation  methods  of  the  long-term  mean  wind  speed 

The  predictions  methods  of  long-term  wind  characteristics 
which  were  initially  proposed  between  the  1940s  and  1980s 
[10,64,109-121]  generally  try  to  estimate  the  mean  annual  wind 
speed  using  just  one  single  reference  station.  In  1948,  Putnam 
[109]  proposed  estimating  the  long-term  mean  wind  speed,  vfT,  at 
the  target  site  station  based  on  the  knowledge  of  its  short-term 
mean  wind  speed,  vf,  using  a  procedure  known  as  climatological 
reduction  by  the  method  of  ratios  [10,103,109,113,115,116],  if  the 
site  in  question  happens  to  be  in  the  vicinity  of  a  long-established 
meteorological  station.  The  proposal  of  Putnam  [109]  was  to  use 
the  following  equation: 

vf  =  [vf7'/vfr]vj:r  (6) 

In  Eq.  (6),  vJ:T  is  the  long-term  mean  at  the  reference  site  and 
v^rits  short-term  mean  (for  the  period  coinciding  with  that  of  vy). 

Conrad  and  Poliak  [115]  state  that  the  requirements  that  need 
to  be  met  for  this  method  to  be  applicable  are  that  the  deviations 
from  the  climatic  mean  must  be  quasi-constant  and  relatively 
homogenous.  The  ‘quasi-constant’  requirement  is  mathematically 
equivalent  to  that  of  a  high  spatial  cross-correlation  between  the 
deviations  of  the  climatic  means  at  the  target  and  reference  sites. 

Corotis  [111]  proposed  a  method  subsequently  referenced  in 
various  studies  [10,103,110,112,114-116]  which  explicitly  includes 
long-term  spatial  cross-correlation,  rLT,  between  both  sites  and  the 
long-term  standard  deviations  at  the  target  site,  sj-T,  and  reference 
site,  sj:T: 

vf  =  vf  +  r^[vf-vf  ][sf /s^]  (7) 

As  the  long-term  cross-correlation  and  long-term  standard 
deviation  are  unknown  for  the  target  site,  Daniels  and  Schroeder 
[112]  carried  out  a  series  of  hypothesis  tests  to  estimate  them.  For 
the  hypothesis  that  there  is  little  variation  in  the  correlation  over 
time,  they  proposed  use  of  the  short-term  correlation  instead  of 
the  long-term  correlation.  The  long-term  variance  at  the  target  site 
is  estimated  by  the  following  equation: 

(s[T)2  =  (sf)2  +  (s^-ST)2  (8) 

The  final  term  of  Eq.  (8)  is  the  long  term  variance  of  reference 
site  mean  wind  speeds  averaged  over  periods  as  long  as  the  short¬ 
term  survey  period,  ST.  This  expression  assumes  that  this  variance 
is  the  same  at  the  target  and  reference  sites.  In  addition, 


it  considers  that  the  variance  of  the  mean  hourly  wind  speeds  at 
the  target  site  during  the  short-term  period  is  representative  of 
the  long-term  conditions. 

Justus  et  al.  [10]  examined  the  procedures  of  Putnam  [109]  and 
Corotis  [111]  and  published  the  accuracy  of  the  results  they 
obtained  in  climatological  fits  performed  with  wind  data  from 
various  anemometer  stations  at  different  locations  in  the  US.  Only 
in  two  of  the  six  pairs  of  cases  analysed  were  the  errors  between 
the  long-term  mean  wind  speeds  estimated  with  the  methods  of 
Putnam  and  Corotis  and  the  observed  wind  speeds  lower  than  the 
errors  obtained  between  the  observed  long-term  mean  wind 
speeds  and  the  short-term  annual  mean  wind  speed  measured 
at  the  target  sites. 

Though  both  the  method  of  ratios  [109]  and  the  method  of 
Corotis  [111]  were  initially  used  without  taking  into  consideration 
wind  direction,  proposed  variations  do  include  consideration  of 
the  influence  of  wind  direction.  Methods  have  been  proposed 
which  differ  from  that  of  Putnam  [109]  in  which  different  ratios 
are  used,  namely  one  ratio  for  each  of  the  N  sectors  of  8°  defined 
by  the  wind  direction  sector  (8k)r  of  the  reference  site.  That  is, 
Eqs.  (6)  and  (7)  are  not  applied  to  the  whole  mass  of  data  together. 
Instead,  in  order  to  take  into  consideration  the  effects  of  wind 
direction,  the  wind  data  are  binned  or  grouped  in  sectors  defined 
by  the  wind  direction  at  the  reference  site  and  these  equations  are 
then  used  for  each  of  the  different  sectors. 

Harstveit  [118]  proposes  using  the  so-called  KH  method,  in 
which  a  ratio  of  mean  wind  speeds  is  used  for  each  wind  direction 
sector.  However,  this  procedure  belongs  to  the  small  group  of 
methods  which  do  not  follow  the  typical  structure  shown  in  Fig.  1, 
as  each  ratio  determines  the  relationship  between  the  short-term 
and  long-term  wind  speeds  at  the  reference  station.  The  KH 
method,  proposed  by  Harstveit  [118]  to  predict  the  long-term 
wind  speed  in  a  particular  wind  direction  sector, (8j)t,  of  the  target 
site  weights  the  contributions  of  each  of  the  wind  direction 
sectors,  (8k)r,  of  the  reference  site: 


,T  12 

[^](flj-)t=  I 

k  =  1 


tfWl 

pFWJ 


ipSTH8j)t\(8k)r] 

\  pLTm)t] 


(9) 


The  probability,  pLT[],  that  the  wind  blows  long-term  in  a 
direction  sector  j  of  the  target  site  is  estimated  making  use  of 
the  probability  that  the  wind  blows  long-term  in  direction  k  of  the 
reference  site  and  making  use  of  the  observed  conditional  prob¬ 
ability,  pST[- 1 -],  at  the  target  site  during  the  short-term  measure¬ 
ment  period: 


pLTW)c]  =  z  {pLTm)r]}{^Tm)tm)r]) 

k=  1 


(10) 


The  Tallhaug  and  Nygaard  method  [117-119]  is  to  a  certain 
extent  a  variation  of  the  method  of  Corotis  [111].  Tallhaug's 
method  estimates  the  long-term  wind  speed  at  the  target  site 
for  each  given  wind  direction  sector  k  of  the  reference  station 
through  the  following  equation: 


Pfw  +  {P'rT](et)r-[^T](eII)r Hl^W /[sf]Wr ) 

(11) 


The  long-term  wind  speed  in  each  wind  direction  sector  j  of 
the  target  site  is  estimated  by  the  Tallhaug  and  Nygaard  method 
[117-118]  using  the  following  equation: 


Z  (Vf  )|(0fc)r{PST  [(0j)r|(0fc)r]  } 

k  =  1 


PLTl(dk)r] 
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The  long-term  directional  probability  at  the  target  site  is 
estimated  using  Eq.  (10). 
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Tallhaug  and  Nygaard  [117]  do  not  recommend  their  model  for 
mean  wind  speed  estimation  by  wind  direction  sector  unless  the 
correlation  is  very  high. 

In  1982,  Barchet  [120],  in  a  report  for  the  Pacific  Northwest 
Laboratory,  described  the  use  of  weather  pattern  typing  for 
characterising  the  wind  regime  of  the  Great  Plains.  The  nine 
weather  patterns  that  the  author  defined  were  based  on  the  main 
features  of  fronts  and  air  masses  associated  with  a  Polar  Front 
Model  of  a  synoptic-scale  cyclonic  storm  system.  In  a  later  report, 
Barchet  and  David  [113]  characterised  the  wind  regime  of  each 
weather  pattern  type  (WPT)  so  that  the  long-term  mean  wind 
speed  could  be  reconstructed  from  the  mean  wind  speed  asso¬ 
ciated  with  each  of  the  nine  WPTs  and  their  frequency  of 
occurrence.  Barchet  and  David  [113]  used  a  reference  station  to 
help  to  calibrate  the  mean  wind  speed  for  a  WPT  and  proposed  the 
following  equation  to  estimate  the  long-term  mean  wind  speed  at 
the  target  site.  This  extrapolation  from  short  records  is  known  as 
the  “adjusted  WPT”  technique: 


3  =  1  (K  )j 


(13) 


in  Eq.  (13),  (FO/)t  is  the  frequency  of  occurrence  of  the  WPTj  at 
the  target  site  [120-121],  (vfT)j  and  (vf T)j  are  the  short-term  mean 
observed  wind  speeds  for  the  WPT  j  at  the  target  and  reference 
sites,  respectively,  (vj T)j  is  the  long-term  mean  wind  speed  for  the 
WPT  j  at  the  reference  site.  According  to  Barchet  and  David  [113], 
a  notable  drawback  of  WPT  techniques  is  that  a  quite  sophisticated 
analysis  needs  to  be  performed  of  the  meteorology  of  the  area  for 
them  to  be  of  use. 
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Fig.  7.  Results  of  applying  the  single  sector  ratio  method  and  the  WF  to  the  hourly 
mean  data  recorded  at  10  m  agl  during  2010  at  two  anemometer  stations  (Airport 
and  1TC)  installed  on  the  island  of  Gran  Canaria  (Spain). 


4.4.  Estimation  methods  of  the  long-term  wind  characteristics  with 
the  support  of  a  single  reference  station 


instead  of  to  the  reference  site's  long-term  mean  wind  speed, 
Eq.  (14).  This  ratio  is  equal  to  the  slope  of  a  line  which  passes 
through  the  origin  and  centroid  of  the  data,  Fig.  6: 


These  MCP  methods  were  published  fundamentally  from  the 
1990s  onwards.  The  most  commonly  employed  methods  use 
a  single  reference  station  and  algorithms  which  are  based  on  linear 
functions  or  models9  [2,4,9,13,17,19,21,61,65,74,76,98-101,103-106, 
122-129]  to  establish  the  relationship  between  hourly  (or  a  lower 
time  interval)  wind  characteristics  of  the  target  site  and  the  site 
chosen  as  the  reference.  Some  of  these  methods  can  be  used  to 
estimate  both  wind  direction  and  wind  speed  characteristics.  The 
format  of  the  data  normally  involves  time  series  of  frequency  tables. 
The  linear  relationships  between  wind  speeds  proposed  in  renewable 
energy  related  literature  include  wind  speed  or  frequency  ratios 
[65,98,100,104,129],  linear  regression  (with  first-order  linear  mod¬ 
els10 *)  [2,5,9,13,17,19,61 ,65,74,76,99,101 ,103-106,123-125,127],  the 
method  of  bins  proposed  by  Beltran  et  al.  [99],  the  Vertical  Slice 
method  proposed  by  LeBlanc  et  al.  [128],  the  SpeedSort,  DynaSort  and 
Scatter  methods  proposed  by  King  and  Hurley  [98]  and  the  variance 
method  proposed  by  Rogers  et  al.  [106]  and  subsequently  referenced 
in  various  studies  [4,21,61,125-128],  However,  higher  than  first-order 
linear  models  have  also  been  proposed  as  well  as  models  which 
establish  non-linear  relationships  [17,24,28,57,60,63,78,100,107- 
109,127,132-137]  or  probabilistic  relationships  [2,9,61,76,105,125- 
127,129-131,133,136-143], 

4.4.3.  Methods  of  ratios 

The  ratio  of  means  with  a  single  sector  method  [65]  differs 
from  the  method  proposed  by  Putnam  [109]  in  that  the  ratio  is 
applied  to  the  reference  site  time  series  of  long-term  wind  speeds 


(vF  = 


(vf) 


(vf) 


(W 


(14) 


The  ratio  of  means  with  multiple  sectors  method  [65]  differs 
from  the  single  sector  version  in  that  it  tries  to  take  into  account 
the  influence  of  the  wind  direction  and  for  that  purpose  uses  a 
ratio  for  each  of  the  N  sectors  of  8°  that  can  be  defined  in  relation 
to  the  direction  sector,  (6yr,  of  the  reference  site,  Eq.  (15).  A  total  of 
12  ratios  are  suggested  in  Ref.  [104].  That  is,  a  ratio  for  each  of  12 
direction  sectors  of  30°  (  345°-15°,  15°-45°,  45  -75°,  75°-105°, 
105°-135°,135°-165°,  165°-195°,  195°-225°,  225°-255  ,  255°  -285°, 
285°-315°,  315°-345°). 
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The  EWEA  method  uses  a  ratio  of  means  with  multiple  sectors 
and  in  its  algorithm  [98]  the  data  of  both  sites  with  wind  speeds 
below  a  certain  cut-off  speed  are  eliminated  and  the  remaining 
data  are  distributed  in  12  direction  sectors  defined  by  the 
reference  site  wind  direction.  A  ratio  of  means  is  then  defined 
for  each  sector  and  Eq.  (15)  is  used  to  estimate  the  long-term  wind 
speeds  at  the  target  site.11  It  is  assumed  that  the  long-term  wind 
directions  for  the  target  site  coincide  with  the  long-term  direc¬ 
tions  recorded  at  the  reference  site.  This  last  assumption,  accord¬ 
ing  to  King  and  Hurley  [98],  has  led  to  poor  sectorial  accuracy  in 
the  studies.  The  authors  point  out  they  often  observed  that  when 
the  target  site  was  windier  than  the  reference  site,  the  ratio 
between  the  target  and  reference  sites'  wind  speeds  fell  as  the 


9  In  this  paper,  when  we  say  a  model  is  linear  or  non-linear  we  are  referring  to 
linearity  or  non-linearity  in  the  parameters  [66,71], 

10  The  value  of  the  highest  power  of  the  predictor  variable  in  the  model  is 

known  as  the  order  of  the  model  [66], 


11  If  there  are  no  data  for  a  particular  sector,  the  ratio  is  established  at  a 
predetermined  value  based  on  data  obtained  from  all  the  sectors  (that  is,  without 
taking  into  consideration  the  wind  direction). 
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Fig.  S.  Fit  of  continuous  probability  distributions  to  the  estimated  wind  speed  and  direction  histograms  of  the  anemometer  station  located  in  Pozo  lzquierdo  (ITC,  Gran 
Canaria-Spain),  when  using  the  WF  method  and  long-term  wind  data  (1999-2010)  recorded  at  Gran  Canaria  airport. 


wind  speed  increased.  In  such  cases,  the  best  relationship  between 
them  was  provided  by  a  curve  rather  than  a  straight  line  through 
the  origin. 

In  [100],  King  and  Hurley  described  two  MCP  methods  which 
they  call  the  Windiness  Factor  method  (WF)  and  the  Moulded  Site 
Data  (MSD)  or  WindScale  method.  In  both  methods,  it  is  assumed 
that  the  long-term  wind  direction  at  the  target  site  coincides  with 
its  recorded  short-term  wind  direction. 

In  the  WF  method,  which  uses  a  single  direction  sector,  a  ratio 
is  defined  between  the  long-term  mean  wind  speeds,  v7T,  and 
short-term  mean  wind  speeds,  v,ST,  at  the  reference  site.  This 
procedure,  the  analytic  expression  for  which  can  be  consulted  in 
Eq.  (16),  belongs  to  the  small  group  of  methods  which  do  not 
follow  the  typical  structure  shown  in  Fig.  1.  In  this  case,  the  ratio 
or  relationship  is  established  between  the  data  of  the  reference 
station  and  not  between  the  short-term  data  of  the  target  and 
reference  stations.  Once  this  ratio  has  been  determined,  adjust¬ 
ment  of  each  of  the  short-term  wind  speeds  recorded  at  the  target 
site  is  performed  by  multiplying  them  by  the  windiness  factor, 
Fig.  7.  This  set  of  modified  or  scaled  wind  speeds  represents  the 
long-term  wind  speeds  at  the  target  site: 


(Vj)f  =  (Vj)f 


v? 


(16) 


Using  the  time  series  of  wind  speeds  generated  with  Eq.  (16), 
together  with  the  time  series  of  wind  directions  recorded  at  the 
target  site,  the  authors  propose  the  construction  of  a  table  of 
relative  frequencies,  representing  the  long-term  wind  resource  at 
the  target  site. 

From  the  table  of  frequencies  obtained  with  the  WF  method,  it  is 
possible  to  create  wind  speed  and  direction  histograms  to  which 
continuous  probability  distributions  can  be  fitted  144-145],  Fig.  8. 
Likewise,  a  continuous  probabilistic  representation  can  be  obtained  of 
the  long-term  wind  resource  at  the  target  site  by  fitting  to  the  values 
of  these  tables  a  joint  probability  density  function  of  wind  speed  and 
direction  for  subsequent  wind  energy  analysis  [146],  The  MSD  method 
[100]  resembles  the  WF  method  in  that  the  relationship  is  established 
between  the  short-  and  long-term  data  of  the  reference  station  and 
not  between  the  short-term  data  of  the  target  and  reference  stations, 
as  occurs  in  most  MCP  methods.  Initially,  in  the  procedure  followed  by 
the  MSD  method,  two  table  of  frequencies  are  defined  of  wind  speed 


and  wind  direction  at  the  reference  site,  Fig.  9:  one  table  of  short-term 
frequencies,  and  another  of  long-term  frequencies, (/y)77. 

For  the  construction  of  the  two  tables,  NB  wind  speed  bins  and  13 
wind  direction  sectors  are  defined.  Twelve  sectors  of  30°  (360  /12)  and 
a  thirteenth  sector  which  will  contain  the  wind  speeds  of  the  time 
series  which  are  lower  than  a  specified  cut-off  speed.  That  is,  the 
thirteenth  sector  will  contain  all  the  wind  speeds  lower  than  the  cut¬ 
off  speed  which  would  otherwise  be  in  the  other  12  direction  sectors. 
King  and  Hurley  [100]  use  the  cut-off  speed  3  m/s  when  vf:7  <  6  m/s, 
otherwise  the  cut-off  speed  is  taken  as  4  m/s. 

In  each  of  the  two  tables  of  frequencies  and  for  each  of  the  13 
direction  sectors  (13  columns  of  the  frequency  tables),  the  total 
frequencies  are  calculated,  Eq.  (17),  as  well  as  the  mean  wind 


speeds,  Eq.  (18): 

( f])rT=  '  Z (fj)rT=  '  ZB(f,j)rT 
i  =  1  i  =  1 

(17) 

,ti  ,st  LT  Z 

lr  zrf,a7iJ)f  Jr  zff, 

(18) 

Thirteen  ratios  of  each  parameter  are  determined,  Eq.  (19), 
from  the  values  obtained  with  Eqs.  (17)  and  (18).  In  order  to  lower 
the  error  produced  when  few  data  are  available  for  a  direction 
sector,  if  the  frequency  ratio  is  within  the  range  from  0.2  to  5,  its 
value  is  taken  as  1.  Likewise,  the  wind  speed  ratio  is  taken  as  1  if  it 
is  within  the  range  from  0.33  to  3.  According  to  the  authors,  in  the 
studies  they  carried  out  they  found  that  the  method  was  not 
sensitive  to  the  exact  specified  range 


RVi^Rf]^ 


m 


(fjfr 


(19) 


Each  of  the  variables  k  of  the  short-term  time  series  recorded  at 
the  target  site  is  fitted  by  multiplying  it  by  the  appropriate  wind 
speed  ratio,  Rv(,  and  each  time  interval  (for  example,  1  h)  is 
modified  or  scaled  by  multiplying  it  by  the  corresponding  fre¬ 
quency  ratio,  RfjU: 

(vk)'[r  =  (vuf'RVj;  (tk)['T  =  (tk)frR/j;  (vk)f  e  sector  j  (20) 


12  If  there  are  sufficient  data  for  all  the  sectors,  for  both  the  short-  and  long¬ 
term  period,  then  the  total  time  of  adjusted  hours  will  be  equal  to  the  total  time 
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Fig.  9.  Flow  diagram  created  by  the  authors  to  illustrate  the  procedure  used  by  the  MSD  method. 


Using  the  generated  wind  speed  and  time  data,  together  with 
the  wind  direction  data  recorded  at  the  target  site,  a  table  of  three 
columns  is  constructed,  Fig.  9,  representing  the  long-term  wind 
data  at  the  target  site.  Another  table  can  be  built  from  this  of  long¬ 
term  wind  speed  and  direction  frequencies  at  the  target  site. 

Wind  speed  and  direction  histograms  can  be  configured  from  the 
table  of  long-term  frequencies  obtained  with  the  MSD  method  to 
which  continuous  probability  distributions  can  be  fitted  [144,145]. 

Hanslian  [129]  presented  two  ratios  methods  named  the 
distribution  of  ratios  method  and  the  fingerprinting  method.  Both 
methods  bin  the  wind  speeds  by  the  reference  site  wind  direction. 
As  a  result,  the  first  method  provides  an  artificial  series  of  wind 
data.  The  wind  speed  and  direction  are  estimated  independently. 
The  second  method  does  not  provide  a  time  series  of  data,  but 
rather  frequency  distributions. 

The  methods  of  ratios  described  here  assume  that  the  wind 
direction  at  the  target  site  is  equivalent  to  that  at  the  reference  site. 

4.4.2.  Methods  based  on  first-order  linear  regressions 

By  far  the  most  commonly  used  MCP  methods  in  the  wind 
industry  have  been  based  on  linear  regression  to  characterise  the 


( footnote  continued ) 

before  the  adjustment.  However,  if  any  frequency  ratio  has  been  adjusted  to  1,  as 
described  in  the  main  text,  the  two  total  times  will  not  be  equal. 


*  8760  data  - Quantile  regression  (QR)  (x=0.5) 

- York  Method  - Simple  orthogonal  regression  (SOR) 


-  Ordinary  linear  regression  (OLR) 


Fig.  10.  Results  of  applying  the  OR,  SOR,  QR  and  York  methods  to  the  hourly  mean 
wind  data  recorded  at  10  m  agl  during  2006  at  two  anemometer  stations  installed 
on  the  island  of  Gran  Canaria  (Spain).  The  reference  and  target  stations  are  named 
WS-6  and  WS-5,  respectively,  in  Ref.  [102]. 


J.A.  Carta  et  al.  /  Renewable  and  Sustainable  Energy  Reviews  27  (2013)  362-400 


379 


relationship  between  the  short-term  (ST)  wind  speeds  of  the 
reference  (r)  and  target  (t)  sites  [128],  followed  by  use  of  this 
relationship  for  long-term  (LT)  estimation  of  the  wind  speed  time 
series  at  the  target  site: 

(Vj)tT  =  fi(Vj)rT  +  a  +  £j  (21) 

In  Eq.  (21),  (V|)[T  are  the  estimated  wind  speeds  for  the  target 
site,  (Vj)J:T  are  the  wind  speeds  measured  at  the  reference  site  and 
ft  and  a  are  the  slope  and  offset  of  the  straight  line  which  it  is 
considered  relates  them.  It  is  usually  assumed  that  q  is  white 
noise.13  The  statistical  distribution  of  the  so-called  residuals,  q,  can 
be  estimated  by  using  the  short-term  data,  Eq.  (22).  These 
residuals  should  be  Gaussian14  and  can  be  added  to  Eq.  (21), 
estimating  them  with  a  random  number  generation  method,  such 
as  the  Monte  Carlo  method  [147]’15: 

£j  =  (vj)V~  VKVj^  +  a]  (22) 

If,  when  using  Eq.  (21),  the  predicted  wind  speed  is  negative 
this  speed  is  taken  as  zero. 

As  described  in  Section  4.2,  with  the  idea  of  taking  into 
consideration  the  effects  of  wind  direction  the  data  are  usually 
binned  or  grouped  in  wind  direction  sectors  and  the  parameters  of 
Eq.  (21)  are  determined  for  each  of  the  different  sectors. 

In  order  to  determine  the  parameters  or  linear  coefficients  (ft, a) 
of  fit  of  the  regression  line,  Eq.  (21),  use  has  fundamentally  been 
made  in  the  methods  that  have  been  proposed  of  the  ordinary 
linear  regression  (OLR)  [13,25-26,28,61,65,74,76,99,103,105-106, 
122-124,126-128]  and,  to  a  lesser  extent,  the  orthogonal  regres¬ 
sion  (OR)  [25-26,28,65,76,128]  and  quantile  regression  (QR)  [28], 

In  the  OLR,  the  parameters  ft  and  a,  Eq.  (21),  which  best  fit  a 
straight  line  to  the  data,  are  determined  by  least  squares.  Given 
that  a  and  ft  depend  on  wind  speed  and  direction,  n  in  Eq.  (23)  is 
the  number  of  wind  speeds  in  the  direction  sector  of  the  reference 
site  under  analysis. 

The  aim  of  the  OLR  is  to  reduce  to  a  minimum  the  sum  of  the 
squares  of  the  vertical  distances  between  the  values  of  the 
dependent  data  (observed  wind  speeds  at  the  target  site)  and 
the  corresponding  values  as  predicted  by  the  fitted  line  [66,67], 
Fig.  10.  The  OLR  assumes  that  the  independent  variable  (reference 
site  wind  speed)  is  error  free  (that  is,  the  values  of  the  dependent 
variable  are  known  exactly)  and  that  only  the  dependent  variable 
(target  site  wind  speed)  has  an  error  component  [148],  The  basic 
conditions  or  fundamental  hypotheses  for  this  model  are  that  the 
errors  are  homoscedastic  variables  (the  errors  have  the  same 
variance  in  all  the  observations  of  the  endogenous  variable)  that 
the  errors  have  a  normal  distribution  with  zero  mean  and  that  the 
pairs  of  sample  observations,  [(v,)t,  (Vj)r[,  are  independent  of  each 
other,  which  will  mean  that  the  covariance  of  the  errors  is  zero 
(Vi,j,¥j): 

/;  =  Ij’=  ityF  (Vjir-q/nlZjL  ftvjf  ZjQ  =  (sVrVt)ST 

ZU  r(v?)f-(l/n)  [ij ’=  i(Vj)f  ]2  {sl)ST 


13  According  to  Thogersen  et  al.  [24],  in  many  cases  the  data  show  that  a  better 
assumption  consists  of  modeling  the  residuals  conditioned  on  both  wind  speed  and 
direction.  This  model  is  available  in  WindPRO,  where  the  residuals  are  modeled  as 
Gaussian,  but  with  the  mean  and  standard  deviation  conditioned  on  the  wind 
speed  and  wind  direction. 

14  The  distribution  of  the  residuals  should  be  examined  for  the  purpose  of 
checking  the  model.  The  type  of  examination  almost  always  comprises  easy  to 
construct  graphs  which  are  generally  very  revealing  when  the  assumptions  are 
violated  [66]. 

15  Consideration  of  the  residuals  is  important  because  wind  energy  varies  with 
the  cube  of  the  wind  speed  and,  therefore,  the  positive  residuals  contribute  much 
more  to  increasing  the  energy  than  the  negative  residuals  contribute  to  lowering  it. 


a  =  vfr-ftv^T  (23) 

In  Eq.  (23),  (Vj),7  and  (vftf  are  the  observed  short-term  (ST) 
wind  speeds  at  the  reference  and  target  site,  respectively,  n  is  the 
number  of  wind  speed  data,  sVr„t  is  the  covariance  between  (vftf 
and  ( Vj)f T,  and  s2r  is  the  variance  of  (Vj)srT. 

Given  that  the  independent  variable  is  also  measured  and  is 
therefore  subject  to  errors,  the  use  of  orthogonal  regression  has 
been  proposed  [28,65,76,128]  as  OR  assumes  that  both  the 
dependent  and  independent  variables  have  associated  errors,  evt 
and  £-vr. 

The  aim  in  OR  is  to  reduce  to  a  minimum  the  orthogonal 
distances  (perpendiculars)  of  the  data  points  to  the  fitted  line 
[148-150].  The  linear  model  that  results  is  known  as  the  Major 
Axis  [151]. 

As  shown  by  Fuller  [149],  in  the  general  orthogonal  regression 
method  (GOR),  if  the  ratio  2,  Eq.  (24),  between  the  variances  of  the 
errors  of  both  variables  is  known  from  independent  information, 
the  best  estimators  of  the  coefficients  ft  and  a  are  those  given  by 
Eq.  (25): 

2  =  Sf‘  (24) 

swr 


(s2t)S7'-2(s2)S7'  +  0(s2  F-XtsiF?  +4X(slVr)ST 
/y“  2(sVtVr)ST 

a  =  v^r-ftv^T  (25) 

In  Eq.  (25),  s2  is  the  variance  of  (Vjjf1. 

In  the  simple  orthogonal  regression  method  (SOR),  it  is 
assumed  that  both  variables  have  the  same  uncertainty  (2=1). 
When  X  cannot  be  reliably  evaluated,  Castellaro  and  Bormann 
[152]  show  that,  under  reasonable  assumptions,  the  SOR  (equiva¬ 
lent  to  the  GOR  with  2=1)  performs  better  than  the  OLR,  Fig.  10. 

Carroll  and  Rupper  [153]  argue  that  the  slope,  ft,  is  often 
overestimated  when  ignoring  the  error  of  the  equation,  <TeEq. 
So,  they  say  that  use  of  OR  should  include  a  careful  assessment 
of  the  error  of  the  equation,  and  not  just  the  customary  (often 
informal)  estimation  of  the  ratio  of  variances  of  the  measurement 
errors  of  Eq.  (27).  That  is,  they  propose  the  use  of  Eq.  (26)  instead 
of  Eq.  (24): 


2  = 


eEq 


(26) 


When  only  the  2  relationship  is  known,  the  GOR  represents  the 
simplest  and  most  widely  used  procedure.  However,  there  are 
other  OR  methods,  such  as  the  chi-square  regression  (CSQ)  and  the 
weighted  total  least  squares  (WLS)  [148,154-155]  which  are 
preferable  when  the  uncertainty  of  each  observation  is  known 
[148],  As  its  name  indicates,  the  CSQ  is  based  on  minimisation  of 
the  chi-square  statistic,  /2,  defined  as  the  sum  of  the  squares  of  n 
random  residuals,  normalized  by  their  variances: 


X2(«,P)  = 


"  [(V,)f-a-/?(V,)f] 

1  =  1  (sjf  +  ft2(sj)V 


(27) 


where  (sf)^T  and  (Sj)fr  are,  respectively,  the  standard  deviations  at 
the  ith  point  of  the  short-term  wind  speeds  of  the  reference  and 
target  sites,  respectively. 

In  1966,  York  [150]  published  an  article  showing  how  to 
perform  the  fit  of  the  straight  line  to  the  data.  York  [150]  showed 
that  the  equations  could  be  resolved  analytically.  However,  the 


16  The  methods  presented  in  [154.T55]  are  based  on  minimization  of  the  chi- 
square  statistic,  but  the  formulation  differs  from  that  indicated  in  Eq.  (27).  Krystek 
and  Anton  [155]  proposed  a  model  for  when  the  uncertainties  of  the  two  variables 
are  correlated. 
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analytical  solution  of  the  equations  is  based  on  an  assumption 
with  respect  to  the  slope  of  the  straight  line,  generally  taking  as 
the  slope  that  obtained  with  the  Major  Axis  method.  York  [150] 
suggests  the  use  of  an  iterative  routine  to  obtain  the  best  estima¬ 
tion  of  the  slope  f).  Fig.  10.  York's  method  has  been  implemented  in 
various  software  applications  used  in  the  wind  industry  [26,65,76] 
that  have  MCP  modules.  Ref.  [75]  contains  a  routine  in  Fortran  77 
which  facilitates  calculation  of  the  regression  line  using  York's 
method.  According  to  Rebbeck  [55],  RES  (Renewable  Energy 
Systems  Ltd.)  claimed  that  its  method  was  equivalent  to  York's. 
However,  this  is  not  true,  as  RES  uses  an  empirical  technique 
which  does  not  match  the  mathematical  rigour  of  York's  method. 

As  stated  above,  the  aim  in  classical  regression  methods  is  to 
minimise  the  sum  of  the  squares  of  the  residuals  and  use  the  mean 
as  estimator.  However,  quantile  regression,  introduced  by  Koenker 
and  Bassett  [156],  is  not  based  on  assumptions  as  restrictive  as 
those  of  classical  linear  regression  and  aims  to  model  the  relation¬ 
ship  that  exists  between  the  dependent  and  independent  variable 
for  different  quantiles  (median  or  any  other  quantile)  of  the 
distribution  of  the  dependent  variable.  As  a  result,  QR  is  able  to 
provide  a  much  more  complete  statistical  analysis  of  the  relation¬ 
ships  between  random  stochastic  variables  [157],  Estimation  of  the 
parameters  in  the  case  of  quantile  regression  is  carried  out  by 
minimisation  of  the  asymmetrically  weighted  absolute  deviations, 
Eq.  (28),  where  t  is  the  quantile  (0<t<1).  As  a  result  of  the 
structure  of  QR,  estimation  of  the  parameters  (a„/?t)  cannot  be 
reduced  to  a  numerical  linear  algebra  problem,  as  in  the  case  of 
the  minimum  squares  method,  Fig.  10.  In  this  case,  a  linear 
programming  problem  is  involved  which  can  be  solved  by  the 
simplex  method  [75], 


(a„Pt)  =  argmin 

aT,(}TeRn 


2  TlWf-ar-^VOfl 


+  2  Cl-T)\(Vif-at-pt(Vif\  l  (28) 

(v,-)f  <  OK-PAVi)?  J 

It  should  be  mentioned  that  among  the  above-described 
ordinary  regression,  orthogonal  regression  and  quantile  regression 
models  with  two  parameters  (a,/?,),  there  are  models  that  have 
been  proposed  which  are  conditioned  on  the  straight  line  passing 
through  the  offset  [2,25,28,65,98,105].  That  is,  the  offset  ( a )  of  the 
regression  line  has  a  zero  value.  So,  a  perfect  correlation  is 
considered  to  exist  between  the  data  of  the  target  and  reference 
stations.  According  to  Joensen  et  al.  [132],  from  a  physical  point  of 
view  it  is  not  logical  to  include  the  offset,  as  this  implies  that  the 
wind  speed  of  the  target  site  will  not  be  zero  when  the  wind  speed 
at  the  reference  site  is  zero.  However,  in  terms  of  the  mean  square 
error,  performance  increases  when  the  offset  is  included,  as  the 
model  is  only  approximate.  According  to  Jain  [2],  though  regres¬ 
sion  through  (0,0)  is  intuitive,  it  does  not  provide  good  represen¬ 
tation  for  the  higher  wind  speeds,  which  are  precisely  the  winds 
which  result  in  energy  production.  King  and  Hurley  [98]  consider 
that  a  straight  line  through  the  origin  does  not  provide  a  good 
model  for  relating  wind  speed  distributions.  Thogersen  et  al.  [24] 
report  that  the  MCP  module  of  the  WinPro  software  application 
can  force  the  regression  line  to  pass  through  the  origin  (0,0). 
However,  they  state  that  this  option  should  only  be  used  with  care, 
as  in  general  it  gives  a  significantly  poorer  fit  of  the  data  than 
methods  in  which  a  non-zero  intersection  with  the  Y-axis  is 
permitted. 

Beltran  et  al.  [99]  propose  replacing  the  linear  regression 
method  with  a  linear  algorithm  which  they  called  the  bin  or  bin¬ 
fitting  method  [158],  as  it  is  based  on  the  standard  procedure  used 
in  measurement  of  the  power  curve  of  a  wind  turbine  [159], 


In  the  proposed  algorithm  [99],  the  wind  speed  data  are 
grouped  into  bins  of  wind  speed  and  wind  direction  sectors.  In 
this  case,  the  wind  speeds  of  the  target  station  are  binned  versus 
the  binned  measured  wind  speeds  of  the  reference  station  in 
ranges  of  0.5  m/s.  In  each  bin  b,  where  the  number  of  data  must  be 
higher  than  10,  the  means  of  the  wind  speeds  of  the  reference  and 
target  stations  are  calculated.  A  set  of  points  is  thus  obtained  in  a 
Cartesian  system  in  which  the  x-axis  represents  the  wind  speed  at 
the  reference  station  and  the  Y-axis  the  wind  speed  at  the  target 
station.  Then,  by  means  of  linear  interpolation  between  these 
points,  the  following  equation,  the  wind  speeds  at  the  target 
station  are  estimated: 


(Vi)f  =  (Vb)f  +  [(Vi){T-(Vll)f] 


(vt+ilT-fytlT 

(v6+1  )f-(vh)fT 


(29) 


where  (vf)[T  is  the  long-term  wind  speed  at  the  target  station, 
(vb)f  is  the  short-term  mean  wind  speed  of  bin  b  at  the  target 
station,  (vb)fT  is  the  short-term  mean  wind  speed  of  bin  b  at  the 
reference  station  and  (v,)fT  is  the  long-term  wind  speed  pertaining 
to  the  range  included  between  the  mean  speeds  of  bins  b  and  b+1 
at  the  reference  station. 

In  those  cases  in  which  the  long-term  reference  station  wind 
speeds  are  greater  than  the  highest  bin  of  the  interpolation, 
estimation  of  the  long-term  target  station  wind  speed  is  carried 
out  through  a  linear  function.  The  authors  propose  that  this  linear 
function  be  obtained  through  a  fit  to  the  set  of  interpolation  points 
[(Yft)?7'  (vb )fT],  excluding  data  below  3.25  m/s,  Fig.  11. 

LeBlanc  et  al.  [128]  proposed  a  method  similar  to  the  bin 
method,  which  they  call  the  Vertical  Slice  method.  The  wind  speed 
domain  is  split  into  vertical  bands  or  slices  and  a  piecewise  linear 
fit  is  performed  of  the  mean  of  the  dependent  variable  of  each 
slice  or  bin.  If  the  number  of  data  in  a  particular  bin  or  slice  is  too 
low  to  obtain  the  mean  of  the  dependent  variable,  then  the  fit  is 
performed  using  a  standard  linear  regression.  This  method  has 
been  implemented  in  the  WindoGrapher  software  [25], 

King  and  Hurley  [98]  have  proposed  three  MCP  methods, 
named  SpeedSort,  DynaSort  and  Scatter.  These  have  been  incor¬ 
porated  in  the  set  of  MCP  methods  used  by  WindoGrapher  [25], 


Fig.  11.  Results  of  applying  the  method  of  bins  to  the  hourly  mean  data  recorded  at 
10  m  agl  during  2010  at  two  anemometer  stations  (Airport  and  ITC)  installed  on  the 
island  of  Gran  Canaria  (Spain). 
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Fig.  12.  Results  of  applying  the  SpeedSort  method  to  the  hourly  mean  data 
recorded  at  10  m  agl  during  2010  at  two  anemometer  stations  (Airport  and  ITC) 
installed  on  the  island  of  Gran  Canaria  (Spain). 


Wind  speed  at  the  airport  (m/s) 


The  algorithm  of  the  SpeedSort  method,  in  an  initial  stage, 
assigns  each  wind  speed  of  the  time  series  representing  the  short¬ 
term  period  to  one  of  the  standard  twelve  30°  direction  sectors 
defined  by  the  reference  site  wind  direction.  The  wind  speeds  of 
the  reference  site  below  the  calm  threshold  (1  m/s)  are  randomly 
distributed  among  the  direction  sectors.  In  each  sector,  the  target 
and  reference  wind  speeds  are  independently  ordered  (in  ascend¬ 
ing  or  descending  sequence),  Fig.  12.  In  this  respect,  the  SpeedSort 
method  displays  a  certain  similarity  to  the  method  of  bins  [99].  In 
a  second  stage,  two  linear  relationships  are  proposed  in  principle 
to  characterise  in  each  direction  sector  the  relationship  between 
the  ordered  short-term  wind  speeds  of  the  target  and  reference 
sites.  Each  of  these  linear  relationships  represents  a  range  of  wind 
speeds  of  the  reference  site.  These  lines  of  fit  try  to  estimate  the 
relationship  that  exists  between  the  wind  speed  frequency  dis¬ 
tributions  of  the  two  sites  as  opposed  to  the  relationship  between 
values  of  the  time  series  of  their  wind  speeds. 

The  authors  [98]  propose  using  a  reference  site  cut-off  speed, 
^ cut-off  *  to  establish  the  separation  of  the  two  datasets  which  will 
be  represented  by  these  linear  functions,  which  they  take  as  the 
lower  value  of  vJ:r/2  and  4  m/s.17  They  use  a  linear  regression  of 
parameters  a  and  p,  Eq.  (24), 18  to  characterise  wind  speeds  higher 
than  the  cut-off  speed  in  a  given  sector  if  the  number  of  data  in 
that  sector  at  least  reaches  a  critical  value  which  depends  on  the 
correlation  coefficient  between  the  target  and  reference  site  wind 
speeds  for  the  same  sector.  If  this  condition  is  not  met,  but  there  is 
at  least  an  established  minimum  number  of  data,  then  the  authors 
propose  using  a  straight  line  which  passes  through  the  origin  and 
the  centroid  of  the  data.  If  this  final  restriction  is  not  met  either, 
they  propose  using  a  straight  line  which  passes  through  the  origin 
and  which  has  a  slope  of  45°.  That  is,  in  this  last  case,  the  wind 
speeds  at  the  target  site  are  considered  to  be  the  same  as  those  at 
the  reference  site. 

To  characterise  the  relationship  between  wind  speeds  lower 
than  the  cut-off  point,  a  straight  line  is  proposed  which  starts  from 
the  lower  end  of  the  straight  line  obtained  with  Eq.  (21)  and  which 


17  According  to  the  authors  [98],  these  values  were  chosen  with  the  idea  of 
optimizing  the  fit  of  the  line  to  the  range  of  wind  speeds  which  is  of  interest  from 
the  point  of  view  of  a  wind  turbine's  electricity  production. 

18  If  the  linear  relationships  estimate  a  negative  long-term  wind  speed  at  the 
target  site,  this  wind  speed  is  assigned  a  value  of  zero. 


Fig.  13.  Results  of  applying  the  DynaSort  method  to  the  hourly  mean  data  recorded 
at  10  m  agl  during  2010  at  two  anemometer  stations  (Airport  and  ITC)  installed  on 
the  island  of  Gran  Canaria  (Spain). 


passes  through  the  origin.  That  is,  a  straight  line  of  zero  offset  and 
slope  p2. 
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To  estimate  the  parameters  a  and  p  of  the  regression  line,  the 
authors  [98]  propose  using  a  variation  of  the  OLR.  In  the  descrip¬ 
tion  of  the  procedure,  instead  of  minimising  the  sum  of  the 
squares  of  the  distances  perpendicular  to  the  x-axis,  they  minimise 
the  sum  of  the  squares  of  the  distances  perpendicular  to  the  line 
that  passes  through  the  origin  and  centroid  of  wind  speeds  higher 
than  the  cut-off  speed  of  the  reference  site.19 

According  to  the  authors,  SpeedSort  is  better  than  the  EWEA 
method  in  terms  of  representation  of  wind  speed  frequencies. 
However,  they  admit  that  a  straight  line  generally  does  not 
perfectly  represent  the  distributions  and  therefore  gives  rise  to 
scatter.  In  this  method,  an  analysis  is  also  undertaken  by  the 
authors  [98]  of  the  differences  between  the  wind  direction  of  the 
target  and  reference  sites  for  each  data  of  the  time  series,  and  the 
long-term  wind  direction  at  the  target  site  is  predicted  as 
explained  in  Section  4.6  of  this  review.  The  output  of  results 
generated  by  the  algorithm  of  the  SpeedSort  method  comprises  a 
table  of  relative  frequencies,  representing  the  long-term  wind 
speeds  and  directions  at  the  target  site. 

The  DynaSort  method  differs  from  SpeedSort  in  three  main 
ways.  First,  DynaSort  uses  a  number  N  of  direction  sectors  which  is 
not  fixed"1  but  is  dependent  on  the  number  of  data  filtered  in  a 
previous  process  and  whose  bounds  are  chosen  by  ensuring  that 
all  the  sectors  have  an  equal  number  of  data.21  The  algorithm  of 


19  According  to  the  authors  [98],  there  are  two  advantages  to  this  procedure: 

(a)  it  gives  the  same  line  independently  of  which  site  is  on  the  vertical  axis  and 

(b)  any  scattering  will  tend  to  pull  the  regression  line  towards  the  line  that  passes 
through  the  origin  and  centroid  rather  than  towards  a  horizontal  line  that  passes 
through  the  centroid. 

20  For  convenience  sake,  they  limit  this  number  to  24. 

21  The  authors  [98]  have  programmed  the  algorithm  of  this  method  to  solve 
cases  in  which  there  are  surplus  data. 
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Fig.  14.  Flow  diagram  created  by  the  authors  to  facilitate  interpretation  of  the  first  block  of  tasks  of  the  algorithm  followed  by  the  scatter  method. 


the  DynaSort  method  proceeds  in  a  similar  way  to  that  of 
SpeedSort,  except  that  instead  of  assigning  each  wind  speed  of 
the  time  series  representing  the  short-term  period  to  one  of  12 
fixed  direction  sectors,  it  assigns  it  to  one  of  N  dynamically 
calculated  direction  sectors  defined  by  the  reference  site  wind 
direction.  Second,  the  DynaSort  method,  in  addition  to  indepen¬ 
dently  sorting  in  each  sector  of  each  site  all  the  wind  speeds  (that 
is,  without  carrying  out  a  prior  filtering),  also  sorts  the  wind 
directions  (though  in  this  case  using  filtered  data)  at  the  reference 
and  target  sites  for  subsequent  prediction  of  the  long-term  target 
site  wind  direction,  as  explained  in  Section  4.6.  As  occurs  with  the 
wind  speeds,  one  consequence  that  results  from  sorting  the  wind 
directions  is  that  the  relationships  that  are  established  are  based 
on  wind  direction  frequency  distributions  instead  of  being  based 
on  time  series  of  the  wind  directions. 

The  third  difference  is  seen  in  that  the  algorithm  of  the 
DynaSort  method  does  not  fit  straight  lines  to  the  n  sorted  wind 
speeds  in  each  sector.  In  this  method,  the  algorithm  tries  to  reduce 
the  amount  of  variation  present  in  the  set  of  ordered  wind  speed 
data  by  smoothing  it  out.  For  this  purpose,  the  concept  of  moving 
averages  is  employed  [67]  with  a  span  of  M  data,  Eq.  (31),  and  the 
trend  line  is  then  used  to  represent  the  relationship  between  the 
long-term  target  and  reference  site  wind  speeds,  Fig.  13: 


<v*+i) t  = 


y  M+k  (v.)51 

ST  _  ‘-“J  -  1  I  k'  VJ 't 


M 


yM+k  ( 

.  \ST  Zjj=t+kSvJ>r 

>  !vk+llr  — 


M 


k  =  0...n—M 


(31) 


If  the  long-term  reference  site  wind  speed  is  within  the  range 
covered  by  the  smoothed  curve,  the  long-term  target  site  wind 
speed  is  estimated  from  this  curve,  interpolating  if  necessary.  One 
of  the  drawbacks  of  this  technique  of  moving  averages  is  that  the 
data  of  the  start  and  end  of  the  sorted  series  are  lost.  So,  the 
authors  [98]  propose  taking  a  straight  line  that  goes  from  the  first 
moving  average  to  the  offset  to  estimate  the  long-term  target  site 
wind  speed  if  the  long-term  reference  site  wind  speed  is  lower 
than  the  range  covered  by  the  smoothed  curve.  However,  if  the 
long-term  reference  site  wind  speed  is  higher  than  the  range 
covered  by  the  smoothed  curve,  the  authors  [98]  propose  using  a 
straight  line  that  commences  at  the  last  moving  average  and  has 


a  slope  /13,  Eq.  (32),  to  estimate  the  long-term  target  site  wind 
speed.  This  slope  is  defined  by  the  bisector  of  the  angle  formed 
between  a  straight  line  that  passes  through  the  origin  and  the  last 
moving  average  and  a  straight  line  of  45°  slope: 
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According  to  the  authors  [98],  the  results  of  the  studies  they 
carried  out  show  that  the  DynaSort  method  was  slightly  less 
accurate  than  SpeedSort  for  periods  below  one  year,  though 
DynaSort  did  tend  to  improve  its  performance  compared  to 
SpeedSort  as  the  length  of  the  short-term  period  grew.  When 
the  short-term  period  is  not  very  long  the  irregularities  in  the 
adjusted  curve  are  perhaps  more  due  to  random  wind  variations 
between  the  sites  than  to  any  long-term  effect.  This  suggests  to  the 
authors  [98]  that  precise  representation  of  wind  speed  frequency 
distributions  requires  a  short-term  period  that  covers  at  least  one 
complete  year  of  data. 

Conceptually  speaking,  the  Scatter  method  differs  from  Speed- 
Sort  and  DynaSort  in  that  in  its  procedure  the  long-term  reference 
site  data  are  modified  to  obtain  the  long-term  target  site  data  by 
using  single  data  of  the  time  series  of  the  short-term  period  of 
both  sites.  The  algorithm  of  the  Scatter  method  contains  two 
blocks  of  differentiated  tasks.  In  the  first  block,  the  data  of  the 
period  corresponding  to  the  short-term  period  are  distributed, 
based  on  the  reference  site  wind  direction,  among  the  N  dynami¬ 
cally  chosen  direction  sectors22  which  are  determined  according  to 
the  total  number  of  data  of  the  period  and  whose  bounds  are 
defined  by  ensuring  that  these  sectors  have  an  equal  number  of 
data,23  Fig.  14.  Then,  the  wind  speeds  are  independently  sorted  in 
ascending  or  descending  order  in  all  the  sectors  of  both  sites. 

In  the  second  block,  Fig.  15,  the  data  of  the  time  series  of  the 
long-term  wind  speeds  and  directions  are  processed.  For  each  data 
of  the  time  series  of  the  long-term  reference  site  wind  speeds, 


22  The  wind  speeds  of  the  reference  site  below  the  calm  threshold  are 
randomly  distributed  among  all  the  direction  sectors. 

23  The  authors  [98]  programmed  an  algorithm  different  to  that  introduced  in 
the  DynaSort  method  to  solve  cases  in  which  there  are  surplus  data. 
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Fig.  15.  Flow  diagram  created  by  the  authors  to  facilitate  interpretation  of  the  second  block  of  tasks  of  the  algorithm  followed  by  the  scatter  method. 


(Vj)J:Tt  the  algorithm  that  corresponds  to  this  block  performs  a  set 
of  routines.  The  direction  sector,  (0k)r,  is  determined  from  among 
the  sectors  defined  in  the  first  block  to  which  it  must  belong  as  a 
function  of  the  wind  direction,  (d,)27,  associated  to  the  wind  speed, 
(Vj)j:T,  in  the  time  series.  The  ten  closest  wind  speeds  to  (v,)P  are 
located  from  the  set  of  reference  station  wind  speeds  of  this  sector, 
( 0k)r ,  in  the  short-term  period  and  one  of  them,(V;)fT,  is  randomly 
selected.  Following  this,  the  corresponding  wind  speed, (v,)fr, 
in  sector  (6k)t  of  the  target  site  is  then  taken.  The  next  line  of 
the  routine  estimates  the  long-term  target  site  wind  speed 
through  the  following  equation24: 

(v,)f  =  (v,)f  +  [(v,)f  -(VitW  (33) 

In  Eq.  (33),  p  is  the  slope  of  a  straight  line,  the  parameters  of 
which  are  determined  through  a  linear  regression,  Eq.  (21),  which 
in  this  case  characterises  the  relationship  between  the  wind 
speeds  of  all  the  target  site  sectors  in  function  of  the  short-term 
wind  speeds  of  all  the  reference  site  sectors.  The  output  of  results 
generated  by  the  algorithm  of  the  Scatter  method  consists  of  a 
table  of  relative  frequencies,  representing  the  long-term  target  site 
wind  speeds  and  directions.  For  the  construction  of  this  table  the 
algorithm  uses  the  wind  speed  bins  versus  the  standard  12  wind 
direction  sectors. 

Rogers  et  al.  [106]  propose  using  the  so-called  variance  ratio 
method,  Eq.  (34),  to  predict  the  long-term  wind  speed  at  a  target 
site: 


(yjf= 


v?-(fr)v? 
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The  slope  and  offset  of  the  linear  relationship  given  by  Eq.  (34) 
are  determined  by  Rogers  et  al.  [106]  forcing  the  mean  and 
variance  of  the  estimated  long-term  wind  speeds  at  the  target 
site,  during  the  short  concurrent  data  period,  to  coincide  with  the 
mean  and  variance  of  the  observed  wind  speeds  at  the  same  site. 


24  If  the  estimated  wind  speed  is  negative,  it  is  set  to  zero. 


This  mean  that  the  square  of  the  slope  p  of  the  regression  line 
coincides  with  the  ratio  between  the  variances  of  the  target  site 
and  reference  site  wind  speeds  in  the  concurrent  data  period. 
Rogers  et  al.  [106]  showed  that  the  variance  ratio  method  gives 
better  results  than  ordinary  linear  regression  when  predicting 
mean  wind  speed.  As  in  the  methods  mentioned  above  based  on 
first-order  linear  regressions  with  non-zero  offset,  if  the  wind 
speeds  predicted  with  Eq.  (34)  are  negative  then  they  are  con¬ 
sidered  to  have  zero  value.  This  method  has  been  implemented  in 
the  MCP  module  of  the  WindoGrapher  software  [25], 

Achberger  [160]  proposes  decomposing  the  reference  and 
target  site  wind  speeds  with  two  orthogonal  axes  (X,Y): 

vx  =  v  sin(d) 

(35) 

vy  =  v  cos  (a) 

where  v  is  the  magnitude  of  the  observed  wind  speed,  d  is  the 
wind  direction  and  vx  and  vy  are  the  Cartesian  wind  components. 
West-east  and  south-north,  respectively,  are  usually  taken  as  the 
positive  directions  of  the  X-  and  V-axes  [145], 

Two  independent  linear  regressions  are  then  performed;  one 
regression  between  the  components  on  the  X-axis  of  the  reference 
and  target  site  short-term  wind  speeds,  and  another  regression 
between  the  same  components  on  the  Y-axis.  Based  on  the 
regressions  performed  between  the  wind  speed  components,  the 
slopes  ( px,py )  and  offsets  ( ax,ay )  are  determined  and  the  compo¬ 
nents  of  the  long-term  target  site  wind  speeds  are  estimated: 

[( Vjhi7  =  MtydP  +  +  exJ  (36) 

[(vj)y]P  =  /fy[(vj  )y]p  +  ay  +  eyj  (37) 

In  a  final  step,  the  long-term  target  site  wind  speeds  are 
estimated  by  combining  their  components: 

(Vj)tT  =  {[[(Vj-y27']2  +  [[(v,.)y]P]2}V2  +  ej  (38) 

It  can  be  deduced  from  Eq.  (38)  that  the  wind  speeds  predicted 
with  this  method  cannot  take  negative  values. 
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Nielsen  et  al.  [74]  propose  use  of  a  vector  linear  relationship. 
Using  this  model,  the  components  (in  a  system  of  X,  Y  orthogonal 
coordinates)  of  the  long-term  target  site  wind  speeds  are  obtained 
through  a  multilinear  regression,  Eq.  (39).  This  method  is  referred 
to  as  the  “vector"  model  74]  or  “vector”  method  [106]: 
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The  coefficients  /Jy  and  ay  are  determined  through  the  matrix 
product  shown  in  the  following  equation: 
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The  modules  of  each  of  the  estimated  long-term  target  site 
wind  speeds  are  determined  through  Eq.  (38). 

Achberger  [160]  also  proposes  a  model  based  on  a  vector 
regression  technique  [161],  First,  the  model  considers  the  two 
horizontal  components  of  the  wind  speed  as  the  real  and  imagin¬ 
ary  parts  of  a  complex  number: 

vt  =  (vx)t  +  i(vy)t 

vr  =  (Vx)r  +  i(Vy)r  (41) 

where  i  =  V— 1  and  the  variances  and  covariance  are  defined  by 
the  following  equations,  respectively: 

(Svr)2  =  [s(vx)r]2  +  [S(vy)r]2  (42) 


(sv,)2  —  [s(v„)t]2  +  [s(vy)t]2 


(43) 


SVrVt  —  [S(Vj)r(Vx)t  +  S(Vy)r(Vy)t]  +  i[S(V*)r(Vy)t  — S(Vy)r(VX)t]  (44) 

Using  this  model,  the  long-term  target  site  wind  speeds  are 
obtained  through  the  following  equation: 

(Vj)F  =  [(vJ)x#r  +  ![(Vj)y]f 

=  (ax  +  fay )  +  (fa  +  ipy){[(vj)x]trT  +  *[(  V))y]r^l  (45) 


The  parameters  fa  and  fa  are  determined  through  Eqs.  (46)  and 


(47),  respectively: 

fa  =  (Pvrv,)x^L 

^Vr 

(46) 

Py  =  (Pv,vt)ySiL 

^Vr 

(47) 

In  Eqs.  (46)  and  (47),  (pVrVt)x  and  {pVrVt)y  constitute  the  real  and 
imaginary  components  of  the  correlation  vector  pVrVt.  These 
components  of  the  correlation  vector  are  given  by  Eqs.  (48)  and 
(49),  respectively: 
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The  parameters  ax  and  ay  are  determined  through  Eq.  (50)  and 


Eq.  (51),  respectively: 

ax  =  [vx]f-^[vx]f  +fa[Vy\f 

(50) 

ay  =  [fa]? -fa(vy]f -fa(vx]? 

(51) 

The  modules  of  each  of  the  estimated  long-term  target  site 
wind  speeds  are  determined  through  Eq.  (38). 

All  the  linear-regression-based  methods  mentioned  above 
assume,  with  the  odd  exception  [74,160]  that  the  target  site  wind 
direction  is  equivalent  to  that  of  the  reference  site.  However, 
Woods  and  Watson  [101  ]  point  out  that  there  are  places  where,  for 
example,  the  complexity  of  the  terrain  often  makes  invalid  the 
assumption  that  there  exists  a  one-to-one  correspondence 
between  the  reference  and  target  site  wind  direction  sectors.  In 
this  context,  the  authors  propose  a  general  approach  which  they 
call  the  matrix  method  that  allows  to  take  into  consideration  the 
possibility  that  the  reference  site  wind  rose  is  different  to  that  of 
the  target  site.  First,  in  the  methodology  they  propose,  a  matrix  E 
(N  x  N)  is  created  with  the  wind  data  binned  in  N  direction 
sectors  of  the  reference  and  target  sites  during  the  concurrent  data 
period.  An  element  ey  of  the  matrix  E  contains  the  number  of 
times  that  the  wind  has  blown  simultaneously  in  sector  i  of  the 
target  site  and  in  sector  j  of  the  reference  site,  Table  1.  Subse¬ 
quently,  those  elements,  ey,  are  discarded  which  comprise  a  small 
fraction,  S,26  of  the  total  number  of  data  recorded  in  each  sector. 
So,  the  work  restarts  with  a  new  matrix  E'  whose  elements  e'y  are 
null  or  coincide  with  those  of  matrix  E  when  the  restriction  given 
by  the  following  equation  is  met,  Table  2: 

e‘j  100  >  ig->  e'y  —  ey-;  i  =  j  =  i...N  (52) 

Xk  =  1  ei.k 

From  E',  a  new  matrix  Z(NxN)  is  constructed  whose  elements 
Zy  are  expressed  as  a  percentage  of  the  total  number  of  data  of 
each  sector  i  of  the  target  site,  Eq.  (53),  such  that  Eq.  (54)  is  met  for 
each  row  (sector)  i,  Table  3. 

zU=vl2e'J  ,  10°:  i=1...N;j=i...N  (53) 

Xfc  =  1  e  kj 

X  zij  =  1  (54) 

j=l 

Once  the  matrix  Z  has  been  constructed,  there  are  two 
approaches  to  predict  the  long-term  wind  speeds  in  each  sector  i 
of  the  target  site. 

In  one  of  the  approaches,  the  mean  wind  speed  in  each  bin  of  a 
particular  direction  sector  is  considered  to  be  equivalent  to  the 
overall  mean  of  the  wind  speed  of  that  sector.  In  addition,  it  is 
assumed  that  the  regression  line  which  is  obtained  with  the  wind 
speeds  of  a  sector  can  be  used  in  each  of  the  individual  bins  of  the 
sector.  In  this  approach,  the  mean  long-term  wind  speeds,  (v,)[T, 
are  estimated  for  each  sector  i  of  the  target  site  from  the  mean 
long-term  wind  speeds,  (Vj)j:T,  of  sectors  j  of  the  reference  station 
by  means  of  the  weighting  of  first-order  linear  regressions 
obtained  for  the  direction  sectors  j,  Eq.  (55).  The  weighting 
coefficients  are  composed  of  the  elements  Zy  of  the  matrix  Z. 
fa  and  aj  in  Eq.  (55)  are  the  slope  and  offset,  respectively,  of  the 
straight  line  that  characterises  the  relationship  between  all  the 
e'y  (i  =  1...N)  short-term  wind  speed  data  of  the  reference  site  (r) 


25  In  their  work,  the  authors  use  12  direction  sectors  (N=12). 

26  The  authors  state  that  a  cut-off  level  S  of  around  5-7%  seems  to  give  the  best 
results.  However,  they  also  state  that  the  choice  of  this  cut-off  percentage  results 
from  an  examination  of  typical  sites  in  the  United  Kingdom,  but  that  there  are 
possibly  cases  for  which  different  cut-off  levels  should  be  adopted. 
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Table  1 

Matrix  E  created  with  hourly  mean  data  recorded  during  2010  at  two  anemometer 
stations  installed  in  the  Canary  Archipelago  (Spain)'1  (Fig.  2). 


Target  site  sectors 
(ITC) 

Reference  site  sectors  (Gran  Canaria  airport) 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

1 

297 

3 

19 

119 

474 

2 

3576 

233 
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9 

12 

22 

25 

85 

888 

3 

125 

251 

67 

22 

17 

4 

6 

4 

9 

5 

21 

46 

4 

66 

17 

31 

31 

26 

8 

3 

4 

2 

7 

13 

34 

5 

54 

10 

10 

13 

36 

30 

7 

8 

3 

7 

14 

28 

6 

63 

12 

6 

10 

18 

24 

9 

10 

2 

9 

20 

52 

7 

90 

8 

6 

5 

17 

44 

64 

18 

6 

8 

19 

40 

8 

131 

37 

33 

14 

17 

56 

257 

266 

99 

51 

54 

36 

9 

30 

5 

2 

2 

4 

5 

24 

43 

28 

30 

15 

10 

8 

1 

15 

17 

12 

5 

11 

2 

2 

6 

21 

11 

3 

12 

2 

11 

3 

1 

a  The  12  wind  direction  sectors  were  all  taken  of  the  same  size  (30°)  and  were 
measured  in  a  clockwise  direction.  Sector  1  is  defined  by  the  bounds  0(N)-30°  and 
sector  12  by  the  bounds  330-360°(N). 


Table  2 

Matrix  E'  created  from  the  frequencies  seen  in  matrix  E  (Table  1 ),  after  rejecting  the 
elements  that  comprise  a  fraction  less  than  or  equal  to  5%  of  the  total  number  of 
data  recorded  in  each  sector  of  the  target  site. 


Target  Reference  site  sectors  (Gran  Canaria  airport) 


site  sectors 


(ITC) 

1 
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8 

15 

17 

12 

5 

11 

6 

21 

11 

3 

12 

2 

11 

3 

1 

Table  3 

Matrix  Z  created  from  matrix  E'  (Table  2)  and  in  which  each  element  is  expressed 
as  a  percentage  of  the  total  number  of  data  of  each  target  site  sector. 


Target  Reference  site  sectors  (Gran  Canaria  airport) 
site 


(ITC) 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

1 

33.4 

13.4 

53.3 

2 

80.1 

19.9 

3 

25.6 

51.3 

13.7 

9.4 

4 

30.3 

7.8 

14.2 

14.2 

11.9 

6.0 

15.6 

5 

30.9 

7.4 

20.6 

17.1 

8.0 

16.0 

6 

33.3 

6.3 

9.5 

12.7 

10.6 

27.5 

7 

30.8 

5.8 

15.1 

21.9 

6.2 

6.5 

13.7 

8 

15.2 

6.5 

29.8 

30.8 

11.5 

6.3 

9 

17.6 

14.1 

25.3 

16.5 

17.6 

00 

00 

10 

14.0 

26.3 

29.8 

21.1 

00 

00 

11 

14.6 

51.2 

26.8 

7.3 

12 

11.8 

64.7 

17.6 

5.9 

and  target  site  (t)  which  pertain  to  the  direction  sector  j: 

?N  ^  r  a  m  \LT 


LT  _  2j=lZijWVj)r  +  a]\ .  ; 
{i)‘  “  100 


!=  1...N 


(55) 


In  the  other  approach,  it  is  only  assumed  that  the  mean  wind 
speeds  of  the  bins  of  a  sector  are  equivalent  to  the  mean  wind 


speed  of  that  sector.  So,  in  this  approach  the  mean  long-term  wind 
speeds,  (V;)”,  are  estimated  for  each  sector  i  of  the  target  site  from 
the  mean  long-term  wind  speeds,  (Vj)“,  of  sectors  j  of  the 
reference  station  through  a  first-order  linear  regression  obtained 
for  the  direction  sector  i,  Eq.  (56).  In  this  case,  the  mean  long-term 
wind  speeds  in  the  N  reference  site  direction  sectors  are  weighted. 
The  weighting  coefficients  are  composed  of  the  elements  zis  of  the 
matrix  Z.  pt  and  at  in  Eq.  (56)  are  the  slope  and  offset,  respectively, 
of  the  straight  line  that  characterises  the  relationship  between  all 
the  d'ij  (j=\...N)  short-term  wind  speed  data  of  the  reference  site 
(r)  and  the  target  site  (t)  which  pertain  to  the  direction  sector  i: 


<ytF=PtI*='zmr 


100 


i  =  l. ..N 


(56) 


The  authors  [101]  state  that,  in  general,  when  the  correlations 
were  good  they  found  little  difference  between  the  hindcasts 
made  with  the  two  approaches.  However,  when  the  correlations 
were  poor,  better  results  were  obtained  with  the  second  approach. 
The  authors  also  undertook  a  comparison  of  their  method  with  a 
standard  MCP  method  of  linear  regression  and  a  standard  MCP 
method  that  included  correction  for  changes  of  direction  in  a  pair 
of  sites  where  the  data  for  approximately  100  days  of  measure¬ 
ment  that  they  used  showed  influence  of  the  so-called  “channeling 
effect".  The  results  indicate  that  hindcasts  of  the  global  mean  of 
the  target  site  obtained  by  the  three  methods  were  almost 
identical  to  the  actual  measurement.  However,  the  method  pro¬ 
posed  by  the  authors  obtained  better  hindcasts  by  direction  sector 
than  the  other  two  methods  used  for  purposes  of  comparison. 

Vermuelen  et  al.  [162]  tested  the  applicability  of  the  method 
proposed  by  Woods  and  Watson  [101]  to  the  studies  they  carried 
out  on  wind  energy  sites  in  Armenia.  This  work  was  undertaken 
within  the  framework  of  an  Armenian-Dutch  project  called 
ArmNedWind,  the  purpose  of  which  was  to  study  the  large-scale 
integration  of  wind  energy  in  Armenia's  electricity  grid.  Measure¬ 
ments  were  taken  over  a  year-long  period  at  five  sites  in  Armenia. 
Monitoring  masts  50  m  agl  were  used  for  this  purpose  (with 
sensors  placed  at  35  m  agl).  In  reference  [162],  only  two  pairs  of 
sites  were  analysed:  Pushkin  Pass  (PP)-Karakhach  Pass  (KH)  and 
PP-Lake  Sevan  (SE).  KH  and  PP  are  situated  in  valleys  in  the  Bazum 
mountain  range  and  have  similar  topographic  characteristics.  SE  is 
found  on  the  shores  of  Lake  Sevan  (Ardanish).  In  this  study,  a 
matrix  of  16  direction  sectors  (N=16)  was  used  instead  of  the 
N=  12  direction  sectors  studied  in  [101].  In  addition,  in  this  study 
they  used  regression  lines  which  pass  through  the  origin  or,  in 
other  words,  regression  lines  with  zero  offset  value.  In  view  of  the 
results  that  were  obtained,  the  authors  concluded  that  the  matrix 
MCP  method  ofWoods  and  Watson  [101]  was  applicable  to  sites  in 
complex  mountainous  terrain  with  strong  wind  direction  changes. 
However,  they  state  that  though  the  reasoning  behind  the  propo¬ 
sal  of  Woods  and  Watson  [101  to  eliminate  all  bins  with  a 
frequency  of  occurrence  below  a  certain  cut-off  level  was  plau¬ 
sible,  its  real-life  application  introduces  a  number  of  problems 
with  solutions  that  are  only  arbitrary.  With  respect  to  the  cut-off 
level,  Vermuelen  et  al.  [162]  state  that  though  it  is  possible  that 
elements  (bins)  of  the  matrix  E  with  a  very  low  occurrence  have 
little  meaning,  the  actual  choice  of  ‘very  low’  level  is  completely 
arbitrary  and  could  easily  lead  to  the  elimination  of  infrequent  but 
significant  bins.  In  the  case  of  the  PP-KH  sites  that  were  analysed 
[162],  using  cut-off  levels  of  0.1%,  1%  and  5%  in  the  matrix  E  implies 
data  losses  of  6%,  31%  and  74%,  respectively.  Also,  elimination  of 
data  of  the  matrix  E  implies  that  the  mean  of  the  remaining  data 
contained  in  the  matrix  E’  must  be  adjusted.  This  adjustment  or 
compensation  is  carried  out  by  proportionally  increasing  the 
frequency  of  occurrence  of  the  remaining  data,  Eq.  (53).  However, 
the  authors  point  out  that  this  assumption  is  again  arbitrary,  as  the 
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eliminated  wind  speeds  are  not  known.  Nevertheless,  not  applying 
a  filter  will  result  in  a  certain  number  of  bins  with  no  meaning. 

Vermuelen  et  al.  [162],  to  avoid  the  use  of  bins  with  no 
meaning,  propose  checking  the  level  of  correlation  between  the 
bins  and  ignoring  the  entire  matrix  if  very  few  bins  show  a 
significant  correlation.  This  was  the  case  of  the  PP-SE  pair,  for 
which  the  data  were  related  in  just  one  portion  of  the  time  (30%) 
and,  consequently,  it  was  not  possible  to  convert  the  mean  short¬ 
term  wind  speed  of  SE  into  a  long-term  estimation  using  the 
data  of  PP. 


4.4.3.  Higher  than  first-order  linear  methods 

Though  MCP  methods  based  on  first-order  linear  regression  tools 
have  been  the  most  commonly  used  methods,  other  types  of  linear 
function  have  been  proposed.  In  the  literature  [55,106-107,130,132, 
163]  and  in  wind  industiy  software  [24,28,58,105],  regressions  have 
also  been  developed  using  higher  order  polynomials. 

Deane  et  al.  [163]  propose  a  method  that  they  call  the 
fragmented  interaction  regression  method  to  fill  the  gaps  in  the 
recorded  wind  speed  data  at  a  target  site.  Like  the  traditional  MCP 
linear  regression,  this  method  splits  the  wind  direction  into 
sectors  but,  instead  of  using  a  simple  linear  regression  between 
the  target  and  reference  sites  it  uses  a  multilinear  regression 
which,  in  addition  to  considering  the  wind  speed  and  direction, 
also  gives  a  value  to  the  product  of  both  variables.  So,  for  each 
direction  sector,  the  estimated  short-term  wind  speeds  of  the 
target  site,  (v,jfr,  are  given  by  the  following  equation: 

(Vj)f  =  P\  (Vj)f  +  /t2(dj)fT  +  MVj)?T(dj)srT  +  a+ej  (57) 

In  Eq.  (57),  (dj)^T  and  (Vj),T  are,  respectively,  the  observed 
reference  site  wind  directions  and  speeds  during  the  concurrent 
data  period,  pt  and  p2  are  the  linear  coefficients,  fl4  is  the 
coefficient  of  interaction  and  a  is  the  offset. 

The  authors  [163]  conclude  that  the  fragmented  interaction 
regression  is  an  improvement  over  the  linear  regression.  However, 
they  also  state  that  more  research  is  required  to  make  proper  use 
of  the  results  of  their  study. 

Joensen  et  al.  [132]  state  that  if  the  measurements  of  the  target 
and  reference  site  are  not  taken  at  the  same  height  agl,  then  a 
first-order  linear  model  is  not  a  reasonable  choice.  In  these  cases, 
they  propose  using  two  alternative  models  to  take  into  considera¬ 
tion  atmospheric  stability.  One  of  the  models,  the  following 
equation,  should  only  be  used  if  no  measure  of  turbulence 
intensity  is  available: 

(y/)fT  =/Mv j)rT  +/?5  (vj)rT  +  a  +  £j  (58) 

where  the  parameters  /?,,  p5  and  a  depend  on  the  reference  site 
wind  direction  sector,  9r.  /q  is  the  linear  coefficient,  /?5  the 
quadratic  coefficient,  a  the  offset  and  e  is  white  noise. 

It  is  proposed  that  the  other  model,  Eq.  (59),  be  used  when  data 
are  available  for  the  temperature  gradient  or  difference,  A Tr, 
between  the  temperatures  measured  at  two  heights  at  the 
reference  site: 

(V#  =  P\{Vjf?  +  MVj)rT(^Tj)rT  +  a  +  ej  (59) 

In  Eq.  (59),  /q  is  the  linear  coefficient,  p4  the  coefficient  of 
interaction,  a  the  offset  and  e  is  white  noise. 

Thogersen  et  al.  [105]  report  that  WindPro  software  has 
implemented  second-order  models  with  non-zero  offset: 

(vjf  =  pt  (Vj)f  +  ps(ffl  +  ej  (60) 

The  regression  parameters  are  calculated  with  the  least  squares 
method,  using  an  optimisation  algorithm  called  Amoeba,  which  is 
described  by  Press  et  al.  [75], 


Riedel  et  al.  [107]  also  proposed  using  a  second-order  poly¬ 
nomial  with  zero  offset,  Eq.  (60),  in  each  of  the  dynamically 
calculated  direction  sectors.  To  determine  the  parameters  /q  and 
/J5,  the  authors  proposed  a  novel  strategy.  The  algorithm  used 
attempts  to  minimise  the  weighted  sum  of  the  squares  of  the 
differences  between  the  histograms  of  the  measured  and  esti¬ 
mated  short-term  target  site  wind  speeds  using  a  chi-square  test. 

MINT  [28]  which  operates  with  statistical  and  mathematical 
software  packages  developed  for  scientific  and  commercial  ends, 
can  perform  regressions  using  first-,  second-  and  third-order 
polynomials,  Eq.  (61).  That  is,  by  using  a  linear,  quadratic  or  cubic 
fit.  If  the  method  has  to  include  a  zero  intercept,  this  software  also 
allows  to  select  this  option.  The  parameters  of  the  models  can  be 
estimated  by  means  of  ordinary  linear,  orthogonal  and  quantile 
regression: 

( VJ ) t T  =P\ (vj)rT  +  /*5 (vj)rT  +  Ps(vj)rT  +  a  +  £j  (61) 

McKenzie  et  al.  [130],  in  addition  to  second-  and  fifth-order 
polynomials,  considered  the  use  of  second-  and  third-order  Taylor 
polynomials.27  Rebbeck  [55]  analysed  models  based  on  fifth-order 
polynomials  and  cubic  splines.  The  author  reached  the  conclusion 
that  none  of  the  models  operated  substantially  better  than  a 
conventional  linear  regression  model  of  two  parameters. 


4.4.4.  Non-linear  methods 

Riedel  et  al.  [107],  with  respect  to  the  quadratic  term  of 
Eq.  (60),  assume  that  it  will  be  low  in  relation  to  the  linear  term. 
So,  they  tried  to  bound  the  quadratic  coefficient,  (j5,  of  Eq.  (60),  by 
taking  a  cut-off  wind  speed,  vcut_0ff,  off  10  m/s  and  a  maximum 
wind  speed,  vmax,  of  30  m/s.  So,  Eq.  (60)  is  replaced  with  the 
following  equation,  with  the  parameters  now  being  /q  and  ry. 


(w  =Pt(yjyl 


n  va(f  0jj  sin  (rj)  .vjT 


(yffr  +£j 


(62) 


According  to  Derrick  [17],  experience  shows  that  though  the 
relationships  between  the  wind  speeds  are  not  always  linear,  in 
most  cases  they  seem  to  fit  well.  However,  if  a  linear  fit  is  not 
suitable  a  power  regression  can  be  employed  using  a  function  of 
the  type  indicated  in  the  following  equation,  with  parameters 
>7  and  S: 

(v,)f  =  i/[(v,)ff  (63) 


In  this  case,  the  linear  regression  is  performed  after  taking 
logarithms  on  both  sides  of  the  following  equation: 

log[(Vj)f  ]  =  log(»/)  +  S  log[(Vj)fT]  (64) 


King  and  Hurley  [98]  also  evaluated  the  fit  of  a  power  law 
curve,  Eq.  (63),  but  despite  their  optimism  it  did  not  provide  any 
significant  improvement  over  the  first-order  linear  regression 
methods,  Eq.  (21).  The  method  of  fit  they  employed  was  the  one 
commonly  used  for  fitting  power  curves  to  data.  That  is,  they  took 
logarithms  to  both  data  series  and  then  applied  a  linear  regression 
of  least  squares  (on  the  line  that  starts  at  the  origin  and  passes 
through  the  centroid).  However,  as  it  takes  logarithms,  this 
method  gives  more  weight  to  the  set  of  lowest  wind  speeds, 
which  is  not  desirable.  The  authors  say  they  are  open  to  sugges¬ 
tions  as  to  other  methods  of  fit  of  the  power  curve  that  might  give 
better  results. 

McKenzie  et  al.  [130],  Clive  [133]  and  van  Lieshout  [135],  for  the 
development  of  the  methodology  they  propose,  take  as  their 
starting  point  the  hypothesis  that  wind  regimes  at  the  target 


27  It  should  be  mentioned  that  the  authors  do  not  strictly  use  the  Taylor 
Expansion  Series  as  a  regression  fit,  as  they  estimate  the  parameters  beforehand 
through  the  fit  of  a  Weibull  distribution  to  the  reference  and  target  site  data. 
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Fig.  16.  Flow  diagram  created  by  the  authors  to  facilitate  interpretation  of  the  block  of  tasks  of  the  algorithm  followed  by  the  discrete  method  of  Mortimer  [136], 


and  reference  sites  can  be  represented  by  Weibull  distributions 
[144.164].28  Likewise,  they  consider  that  if  a  monotonic  relation¬ 
ship  exists  between  the  target  and  reference  site  wind  speeds, 
then  the  values  of  their  cumulative  distribution  functions  (CDF) 
will  be  equal.  The  three  parameter  Weibull  distribution  function 
(K,  shape  parameter;  C,  scale  parameter  and  y,  location  parameter) 
can  be  expressed  in  closed  form  and  is  susceptible  to  linearisation. 
So,  making  the  cumulative  distribution  function  of  the  target  site 
equal  to  the  cumulative  distribution  function  of  the  reference  site 
leads  to  the  expression  shown  in  the  following  equation: 


C? 


KVj)f  =  Cf 


(vdf-rT 


1  Kr  /Kt 


r? 


(65) 


From  Eq.  (65)  the  authors  [130,133]  construct  Eq.  (66),  con¬ 
sidering  that  =  0  and  where  the  parameters  p  and  a  are  given 
by  Eq.  (67): 

(Vj)tT  =  £(vf)f  +  a  (66) 


28  It  should  be  mentioned  that,  though  the  two  parameter  Weibull  distribution 
has  been  widely  used  in  renewable  energy  related  literature  to  represent  wind 
regimes  [2,144,164],  in  several  references  [144,165-167]  sites  have  been  indicated 
where  the  wind  data  do  not  obey  a  two  parameter  Weibull  distribution  but  rather 
other  probability  density  functions. 


s=%-,P=C?{Cr‘) 


ST^S-  a  =  yf 


(67) 


Only  in  the  event  that  the  shape  parameters  of  both  distribu¬ 
tions  are  equal  (5=1),  will  Eq.  (66)  be  a  first-order  linear  function. 
So,  as  stated  by  Clive  [133],  an  error  is  being  made  if,  in  linear 
regression  based  MCP  methods,  the  shape  parameters  of  the 
Weibull  distribution  are  different  at  the  reference  and  target  sites. 

Sreevalsan  et  al.  [134]  present  and  discuss  an  MCP  method  based 
on  fast  Fourier  transforms  (FFT),  using  as  starting  point  a  previous 
study  undertaken  by  Flunt  and  Nason  [168],  Considering  the  differ¬ 
ence  in  amplitude  for  three  months  of  time  series  data  of  the  target 
and  reference  sites,  the  method  generates  a  one  year  time  series  of 
the  target  site  wind  speeds.  According  to  the  authors,  the  comparison 
made  between  the  simulated  and  measured  wind  data  showed  a 
good  correlation.  However,  they  state  that  it  remains  to  be  seen 
whether  this  procedure  can  be  adopted  for  data  synthesis  of  multiple 
years  with  small  deviations.  They  also  comment  that  their  study  is 
only  a  preliminary  one  and  that  more  data  series  from  other  sites 
need  to  be  analysed  before  general  validation  of  the  results. 

According  to  Hunt  and  Nason  [168],  wavelet  methods  have 
been  shown  to  provide  more  reliable  estimations  than  prediction 
methods  based  on  a  simple  linear  regression.  Moreover,  they 
report  that  their  models  have  the  added  benefit  of  often  being 
physically  interpretable.  However,  the  computational  complexity 
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of  the  model  is  very  high  when,  for  example,  one  year's  worth  of 
data  is  used. 

4.4.5.  Probabilistic  methods 

Various  references  have  proposed  the  use  of  MCP  methods 
based  on  probabilistic  methods  [125,130,133,136-143]. 

Mortimer  [136]  proposes  a  method  he  calls  the  discrete 
method,  in  which,  in  the  concurrent  data  period,  the  data  of  both 
sites  are  distributed  in  a  matrix  B,  in  which  an  element,  by, 
represents  the  wind  speed  bin  i  (with  bin  width  of  1  m/s)  and 
wind  direction  sector  j  of  the  reference  site,  Fig.  16.  So,  this  method 
is  based  on  the  hypothesis  that  the  target  and  reference  site  wind 
directions  coincide.  Each  wind  speed  bin  i  of  the  reference  site 
wind  direction  bin  j  contains  the  set  of  ratios  calculated  between 
each  target  site  wind  speed,  (v^f7,  which  is  paired  by  date  and 
time  k  with  the  corresponding  reference  site  wind  speed,  (v^.);!7. 
Two  matrices  are  generated  from  the  matrix  B:  one,  AR,  in  which 
each  element,  ar,,,  represents  the  average  ratio  corresponding  to 
wind  speed  bin  i  of  the  reference  site  wind  direction  sector  j, 


Eq.  (68),  and  another,  SD,  in  which  each  element,  sdy,  represents 
the  standard  deviation  of  the  ratios  of  wind  speed  bin  i  and  wind 
direction  sector  j  of  the  reference  site,  Eq.  (69): 

anj  =  1  £  (bijh  (68) 

nij  k  =  1 

sdij  =  1  £  [(bylk-ary]2  (69) 

nij~  1  k  =  1 

where  n,j  is  the  number  of  ratios  stored  in  the  element  by  of 
matrix  B. 

Using  the  AR  matrix  of  the  means  of  the  ratios  and  the  SD 
matrix  of  the  standard  deviations  of  the  ratios,  the  n  wind  speeds 
of  the  long-term  target  site  wind  speed  series  are  estimated  as  a 
function  of  the  n  data  of  the  long-term  reference  site  wind  speed 
(vk)7T  and  direction  (dk)7T  series,  through  the  following  equation: 

(Vfe)tT  =  [arij  +  £ij](vk)rT;  (vfe)rT, (dk)rT(=bij;  k=  l...n  (70) 


(^2^)  Storing  data  in  appropriate  cell  i,j. 

Determine  the  appropriate  bin  j  and  sector  i. 
(^5^)  Calculate  the  long-term  wind  speed. 
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Fig.  17.  Flow  diagram  created  by  the  authors  to  facilitate  interpretation  of  the  block  of  tasks  of  the  algorithm  followed  by  the  matrix  method  [24,105]  when  using  the 
probability  distribution  associated  with  the  variables  Ay  and  Ad  measured  during  the  concurrent  data  period. 
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Storing  data  in  appropriate  cell  i  j  (^3^  Determine  the  appropriate  bin  j  and  sector  i 
(^6^)  Calculate  the  long-term  wind  speed 


Fig.  18.  Flow  diagram  created  by  the  authors  to  facilitate  interpretation  of  the  block  of  tasks  of  the  algorithm  followed  by  the  matrix  method  [24,105]  when  using  polynomial 
functions  together  with  a  bivariate  Gaussian  distribution. 


where  ey  is  a  random  variable  with  symmetric  triangular  distribu¬ 
tion  [169]29  used  with  the  idea  of  taking  into  consideration  the 
corresponding  standard  deviation,  sdy,  of  the  ratios  or  their 
variability.  Given  a  random  value  U  extracted  from  a  uniform 
distribution  in  the  interval  (0, 1),  then  the  random  variable,  ey,  is 
given  by  the  following  equation: 

0<U  <1/2  =>  £ij  =  (2VW-V6)sdij 
1/2<U<1  =>  Eij  =  (V6^2 \/3(\—U))sdij 

Mortimer  [136],  as  a  result  of  tests  carried  out,  concludes  that 
the  discrete  method  can  predict  extreme  wind  speeds  better  than 
first-order  linear  regression  methods. 


29  The  triangular  distribution  is  used  to  describe  a  variable  which  is  limited  to  a 
particular  range  but  which  very  probably  has  values  in  the  middle  of  that  range. 


Sheppard  [56]  modified  Mortimer's  method  [136]  by  replacing 
the  triangular  distribution  with  a  normal  distribution.  The  direc¬ 
tion  sectors  used  by  Sheppard  [56]  were  one  of  the  following: 
22.5°,  451  90°,  or  180°.  In  the  procedure,  the  smallest  possible 
wind  direction  sector  size  is  chosen  with  the  restriction  that  there 
are  at  least  four  observations  in  each  bin. 

Rogers  et  al.  [106]  used  the  method  proposed  by  Mortimer  [136] 
for  purposes  of  comparison  with  three  other  MCP  methods.  In  the 
algorithm  that  was  implemented,  the  following  actions  were  pro¬ 
grammed:  (a)  if  the  reference  site  wind  speed  is  below  1  m/s,  the 
wind  speed  ratio  takes  the  value  1,  (b)  if  insufficient  data  for  a 
particular  element,  by,  of  the  matrix  B  are  available  to  calculate  an 
average  ratio,  then  the  ratio  is  determined  as  the  ratio  of  the  mean  of 
all  the  wind  speeds  of  the  direction  sector  j  and  the  standard 
deviation  is  taken  as  zero,  (c)  if  Eq.  (67)  predicts  negative  long-term 
target  site  wind  speeds,  these  wind  speeds  take  zero  value. 
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According  to  Thogersen  et  al.  [24,105],  one  of  the  MCP  methods 
available  in  WindPro,  called  the  matrix  method,  calculates  in  the 
concurrent  data  period  the  differences  in  wind  speed  (speed-up), 
Eq.  (72),  and  the  differences  in  wind  direction  (wind  veer), 
Eq.  (73),  of  the  target  and  reference  sites,  sorting  the  results  by 
reference  site  wind  speed,  (vk);?T,  and  wind  direction,  (dfc )^r.  Figs.  17 
and  18.  Results  are  stored  in  a  matrix  G,  where  each  element  or 
cell,  gij,  corresponds  to  a  wind  speed  bin,  fa„  (with  default  width  of 
1  m/s)  and  a  wind  direction  sector, (Sj)r,  (default  value  of  30°)  of  the 
reference  site: 

Avk  =  (Vfc)f -(vfc)f  (72) 


A  dk  =  (dk)fr-(dk)fT  (73) 

Since  some  elements,  gy,  of  the  matrix  might  end  up  empty, 
the  algorithm  implemented  can  interpolate  or  extrapolate  statis¬ 
tical  properties  from  the  other  elements  or  cells  with  a  sufficient 
number  of  data  in  order  to  fill  in  these  gaps  using  for  this  purpose 
polynomial  fits,  Eq.  (74).  The  properties  estimated  in  each  of  the 
empty  cells  of  the  matrix  are  the  means  {fiAv,fiAd)  and  standard 
deviations  ( sAv,sAd )  of  the  variables,  Ay  and  Ad,  as  well  as  the 
correlation  coefficient,  rAvAd,  between  them: 

NP 

SPij  =  a+  2  Pkk[(v ’j)rT]kk+ei  (74) 

kk=  1 


In  Eq.  (74),  SPy  is  the  estimated  statistical  property  in  cell  gy, 
the  parameters  a  and  p  of  the  models  depend  on  the  wind 
direction  sector  i  under  consideration  and  on  the  estimated 
statistical  property  and  et  is  the  residual  term  [2],  The  implemen¬ 
ted  algorithm  allows  to  select  the  order  of  the  polynomial  (NP), 
and  this  order  can  be  different  for  each  statistical  property.  NP=1 
is  the  predetermined  option  for  estimating  the  mean  wind  speed¬ 
up.  However,  to  estimate  the  mean  wind  veer,  the  predetermined 
option  considers  that  these  changes  are  independent  of  wind 
speed  and  therefore  proposes  a  zero-order  polynomial.  For  stan¬ 
dard  deviations,  the  recommendation  is  to  use  NP=1  for  wind 
direction  changes  and  NP=2  for  wind  speed  changes. 

To  estimate  the  long-term  target  site  wind  speeds,  (vk)[T,  the 
implemented  algorithm  admits  two  options.  One  of  them,  Fig.  17, 
consists  of  estimating  the  difference  in  wind  speeds  and  the 
difference  in  wind  directions  as  a  function  of  the  reference  site 
wind  speed  and  direction  through  the  use  of  the  probability 
distribution  associated  with  the  variables,  Av  and  Ad,  measured 
during  the  concurrent  period.  This  option  is  the  one  recommended 
if  the  number  of  samples  in  the  relevant  matrix  cell  is  at  least  five. 
The  other  possibility,  Fig.  18,  lies  in  using  the  polynomial  functions, 
Eq.  (74),  together  with  a  bivariate  Gaussian  distribution,  Eq.  (75)  [71], 
of  the  speed-up  and  wind  veer: 


Pd/(  Av,  Ad)  =  Aexp 

((Av~Pav)2AL)— 2rAvAd((Av— ;iAVMAd— ^Ad)/sAvsAd)  +  ((Ad—  pAd)2/s\d) 

2V/l-rA,Ad 

(75) 


where  A  is  given  by  the  following  equation: 
A=  1 


2.7TS AyS Ad  \/l  r AvAd 


(76) 


Starting  with  the  long-term  reference  site  data  time  series,  the 
long-term  target  site  wind  speed  and  direction  time  series  is 
estimated  through  the  following  procedure.  First,  each  pair  of  data 
of  the  series  [(vfe)J:T,  (dfc)J:r]  indicates  the  cell,  gy,  of  the  matrix  G 
that  has  to  be  used,  Figs.  17  and  18.  Second,  if  the  option  is  chosen 
of  using  the  data  measured  during  the  concurrent  period  (Fig.  17), 
then  from  the  data  contained  in  the  above  indicated  cell,  gy,  a  pair 
of  values  (Av1T,A dLT)  are  selected  by  bootstrapping  [170],  which 


are  considered  to  be  long-term  representative.  If  the  second  option 
is  selected  (Fig.  18),  then  the  corresponding  pair  of  values 
(Avlr,A dLT)  are  randomly  generated  from  the  bivariate  Gaussian 
distribution  associated  with  the  above-indicated  cell,  gy,  through 
the  Monte  Carlo  simulation  method  [147],  Subsequently,  the  long¬ 
term  target  site  wind  speed  and  direction  are  estimated  using 
Eqs.  (77)  and  (78),  respectively. 


(Vk)f  =  A^  +  (Vk)f 

(77) 

(dk)f  =  AdLT  +  (dk)? 

(78) 

Lambert  and  Grue  [142]  report  that  common  matrix  methods 
often  perform  well  when  compared  with  other  methods,  but  they 
are  normally  used  to  generate  frequency  distributions  rather  than 
data  time  series.  So,  the  authors  propose  a  method  they  call  the 
Matrix  Time  Series  method  (MTS),  the  algorithm  of  which  is 
available  in  the  MCP  module  of  the  WindoGrapher  software  [25] 
and  which  consists  of  an  adaptation  of  the  classic  matrix  method 
to  enable  estimation  of  long-term  target  site  data  time  series.  The 
procedure  involves  four  basic  steps.  First,  the  reference  and  target 
site  wind  speeds  of  the  concurrent  data  period  are  distributed  in 
bins  defined  by  the  reference  site  wind  direction  and  a  matrix  U  is 
constructed  for  each  reference  site  wind  direction  sector.  Each 
matrix  U  constitutes  a  joint  probability  distribution  representing 
each  sector  and  period.  Each  cell,  Uy,  of  each  matrix  U  contains  the 
number  of  times  that,  during  the  concurrent  data  period,  target 
site  wind  speeds  of  bin  i  and  reference  site  wind  speeds  of  bin  j 
were  recorded.  So,  in  each  column  j  of  a  matrix  U  the  probability 
mass  function,  PMFj,  is  defined  of  the  target  site  wind  speed 
conditioned  on  the  reference  site  wind  speed  of  bin  j.  From  each 
PMFj  it  is  possible  to  construct  the  cumulative  probability  mass 
function,  CPMFj,  of  the  target  site  wind  speed  for  the  reference  site 
bin  j. 

In  the  second  step,  a  time  series  of  percentiles1  is  constructed 
for  the  target  site.  For  this,  each  pair  of  data,  (vk)srT  and  (<4)^,  of  the 
reference  site  time  series  of  wind  speeds  and  directions  allows  to 
select  the  appropriate  direction  sector  and  appropriate  bin  j 
(column)  of  the  matrix  U  of  that  sector,  while  the  corresponding 
data,  (Vj;)^,  of  the  target  site  time  series  of  wind  speeds  indicates 
the  corresponding  percentile  in  the  CPMFj,  Fig.  19. 

For  the  third  step,  the  algorithm  synthesises  a  long-term  time 
series  of  percentiles  for  the  target  site.  For  this,  a  first-order 
transition  probability  matrix  of  a  Markov  chain  is  constructed 
[171], 31  which  stores  the  frequency  with  which  the  percentiles 
change  from  one  state  to  another.  By  generating  random  numbers, 
artificial  scenarios  are  created  to  fill  in  the  missing  data  period  of 
the  target  site  time  series.  If  the  final  synthetic  percentile  (at  the 
end  of  the  unknown  time  series)  does  not  adequately  coincide 
with  the  value  of  the  first  percentile  of  the  measured  time  series, 
then  the  algorithm  discards  the  simulated  scenario  and  proceeds 
to  construct  others  until  a  reasonable  match  is  obtained  between 
the  final  datum  of  the  synthesised  series  and  the  first  datum  of  the 
measured  series.  The  algorithm  has  the  option  of  smoothing  out 
the  time  series  of  percentiles  between  the  second  and  third  steps 
using  the  technique  of  moving  averages  [67],  The  authors  [142] 
have  verified  that  this  option  improves  the  accuracy  of  the  results 
in  many  cases. 

The  fourth  and  final  step  attempts  to  transform  the  time  series 
of  synthetic  percentiles  into  synthetic  target  site  wind  speeds, 
Fig.  20.  Essentially,  the  procedure  consists  of  reversing  the  second 


30  They  are  the  99  values  that  split  the  distribution  into  100  equal  parts.  The 
percentile  is  a  measure  of  non-central  position  that  tells  us  how  a  value  is 
positioned  with  respect  to  all  the  values  of  the  sample. 

31  First-  and  second-order  Markov  chains  have  been  used  in  renewable  energy 
related  literature  to  model  wind  data  [172-173]. 


J.A.  Carta  et  al.  /  Renewable  and  Sustainable  Energy  Reviews  27  (2013)  362-400 


391 


(4- 


~  ..  ST:  Short-Term 

Wind  direction  at  the  reference  site 

- 

kW  w 

1  1  1 

VjfcT  ’ 

j  Lai 

N 

sector^ 


(V), 


(V), 


100 


Wind  speed  at  the  reference  site 

ST 

- 

~CD — 


Time 


Wind  speed  at  the  target  site 

ST 

- 

l 

Ji  r  m_ i_ i_ i 

i 

•it  ™ 

i 

Percentile  at  the  target  site 


ST 
n - r 


Time 


Time 


Wind  speed  bins  (reference) 


(v*r 


•© 


Nb 


uuj 

Nb 


>  T3 


(vj 


& 


Wind  speed  bins  (target) 

o 


$ 

(’['Mi'  :  Cumulative  probability  mass  function. 

(^7^|  Create  the  percentile  time  series. 

(^6^j  Determine  the  appropriate  percentile.  \  100 

(^5^)  Determine  the  appropriate  target  speed  bin  (i).  \ 

fj)  Construct  the  cumulative  probability  distribution  function.  "CjD 
[^4]  Determine  the  appropriate  reference  speed  bin  (j).  g 

Construct  the  joint  mass  probability  distribution. 

0  Determine  the  appropriate  reference  direction  sector.  ^ 

Fig.  19.  Flow  diagram  created  by  the  authors  to  show  the  first  tasks  of  the  algorithm  followed  by  the  MST  method. 


step.  In  this  case,  each  pair  of  data,  (vk)^T  and  (dfc)J:r,  of  the  long¬ 
term  reference  site  time  series  of  wind  speed  and  direction  allows 
to  select  the  appropriate  direction  sector  and,  in  that  sector,  the 
appropriate  associated  column  (bin  j )  of  the  matrix  U.  The 
synthetic  percentile  k  allows  to  calculate  in  the  CPMFj  of  the  bin 
j  of  that  matrix  the  synthetic  wind  speed,  (vk)[T,  that  corresponds 
to  the  target  site  data  k. 

Salmon  and  Walmsley  [137]  slightly  modified  a  method  first 
developed  by  Walmsley  and  Bagg  [138],  which  allows  that  the 
reference  site  wind  rose  can  be  different  to  the  target  site  wind 
rose.  Initially,  in  the  proposed  methodology,  a  matrix  H 
(3072  x  3072)  is  created  which  constitutes  a  joint  relative  fre¬ 
quency  distribution  of  short-term  wind  speed,  wind  direction  and 
atmospheric  stability.  That  is,  each  element,  hff,  of  the  matrix 
represents  the  short-term  relative  frequency  of  occurrence  of 
category  i  at  the  reference  site  and  of  category  j  at  the  target  site. 
The  number  of  categories,  Nil,  is  obtained  after  classifying  the 
short-term  data  of  both  stations  by  wind  direction  (16  equal  sized 
sectors),  wind  speed  (32  bins)  and  atmospheric  stability  (six  bins). 

The  frequencies  of  occurrence  of  category  i  at  the  reference  site 
and  of  category  j  at  the  target  site  are  estimated  from  the  following 


equation: 


ST 


„  Nh  Nh 

(Fi)V=  Z  ;  (Fj)t  =  Z  htJ 

i  =  i  i  =  i 


(79) 


The  method  calculates  a  matrix  C  of  coefficients  Cy,  Eq.  (80), 
which,  assuming  they  remain  constant  over  the  long-term,  are 
used  to  estimate  the  long-term  target  site  relative  frequencies, 

Eq.  (81): 


(Ftfr 


(80) 


(FjF=  Z  CtJ<Fi)rT  (81) 

i=  1 

Garcla-Rojo  [139]  also  uses  a  procedure  based  on  calculation  of 
the  joint  probability  mass  function  pfLr(vt,d[,vr,dr)  of  the  short¬ 
term  wind  speed  ( v )  and  direction  (d)  of  the  target  and  reference 
sites.  Starting  with  this  short-term  joint  probability  mass  function 
and  the  probability  mass  function  of  long-term  wind  speed  and 
direction  at  the  reference  site,  pL^(v,d),  the  probability  mass 
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Fig.  20.  Flow  diagram  created  by  the  authors  to  show  the  last  tasks  of  the  algorithm  followed  by  the  MST  method. 


function  of  long-term  wind  speed  and  direction  at  the  target  site, 
p[T(v,d),  is  estimated: 

Nw  Nd 

PtT(Vi,dj)=  jr  2  p^r(vildj,vk,dz)p^T(Vil,dz)  (82) 

k  =  lz= 1 


Nw  and  Nd  are  the  numbers  of  bins  of  wind  speed  and 
direction,  respectively,  of  the  joint  probability  mass  function.  In 
this  study,  twelve  30°  bins  of  wind  direction  were  used  to  define 
the  target  and  reference  site  wind  direction.  The  widths  of  the 
wind  speed  bins  were  1  m/s. 

Carta  and  Velazquez  [141]  proposed  a  new  MCP  method,  the 
algorithm  of  which  uses  the  target  site  wind  speed  probability 
distribution  function  conditioned  on  the  reference  site  wind 
speed.  Contingency-type  bivariate  distributions  with  specified 
marginal  distributions  were  used  for  this  purpose: 


/f(Vt|vr)  = 


2  VS2' 


(yr-l)[Sl-2y/Ff(vt)][Sl-2y/Ff(vr)] 

S2 


+  W+  1 


(83) 


where  SI  is  given  by  Eq.  (84)  and  S2  is  given  by  Eq.  (85): 

51  =  1  +  [Ff  (vt)  +  FfT(vr)](Vz— 1 )  (84) 

52  =  S12— 4vr(vr— l)Ff(vt)Ff(Vr)  (85) 

In  the  above  equations,  FstT(vt )  and  F^T(vr)  are  the  cumulative 
distribution  functions  of  the  wind  speed  at  the  target  and 
reference  sites,  respectively.  f^T(vt)  is  the  wind  speed  probability 
density  function  of  the  target  site,  y  is  the  coefficient  of  association 
which  needs  to  be  estimated.32  To  estimate  y/,  the  authors  equate 
the  values  of  the  correlation  coefficient  of  the  sample  and 
the  model. 

The  marginal  distributions  used  were  two-component  mixture 
Weibull  probability  density  functions  [144,165-166],  which 
depend  on  five  parameters. 


32  This  bivariate  distribution  has  only  one  parameter,  in  addition  to  the 
marginals. 
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In  the  proposed  method,  from  the  short-term  conditional 
probability  density  function,  Eq.  (83),  and  the  long-term  reference 
site  wind  speed  probability  density  function,  fj(v),  the  long-term 
target  site  wind  speed  probability  density  function,  fj(v),  is 
estimated: 

1  f°° 

rtT{Vt)=  CNF  Jo  (Vt\yr)frT (vr)dVr  (86) 

where  CNF  is  a  normalisation  factor: 

[•  OO  r-  OO 

CNF=  /  Jf  (vf|vr)/‘r(vr)dvr  dvt  (87) 

The  stratified  fc-fold  cross-validation  method  with  k  partitions 
was  used  to  train  and  test  the  model  [174],  Using  data  series 
recorded  in  the  Canary  Islands  (Spain),  the  authors  concluded  that 
this  method  gave  better  results  in  most  cases  than  those  obtained 
with  the  variance  ratio  method  [106],  Weibull  scale  method 
[130,133]  and  a  modification  of  the  method  developed  in  [139], 
which  they  used  for  comparison  purposes. 

The  modification  of  the  method  described  in  [139]  consisted  of 
using  the  probability  density  function  of  wind  speed  and  direction 
at  the  target  site  conditioned  on  the  wind  speed  and  direction  at 
the  reference  site,  pJ(Vj,dj\vk,dz),  Eq.  (88),  instead  of  the  joint 
probability  mass  function  of  wind  speed  and  direction, 
PtLr(yt,dt,vr,  dr)3:!: 

1  Nw  Nd 

p[T(vudj)=  Y,  I  pf(Vi,dj|vk,dz)pf(vfc,dz)  (88) 

Livr  k  =  lz  =  1 

where  CNF  is  a  normalisation  factor: 


Nw  Nd  Nw  Nd 

CNF=  £  i  I  i  PcT(vo  dj\vk,  dz)pj(vk,  dz)  (89) 

!=lj=lfc=lZ=l 

Romo  et  al.  [125]  also  propose  models  based  on  conditional 
probability  density  functions.  They  have  presented  two  models, 
one  based  on  a  bivariate  normal  distribution  [143,175]  and  the 
other  based  on  a  bivariate  Weibull  distribution  function,  Eq.  (90), 
whose  marginal  distributions  are  two-parameter  Weibull  distribu¬ 
tions: 


F(vt,vr)=  l-exp<  - 


(90) 


where  X  is  a  parameter  which  controls  the  degree  of  association 
between  the  two  variables.  The  five  parameters  of  the  bivariate 
Weibull  distribution  are  estimated  using  the  maximum  likelihood 
method. 

In  their  work,  the  authors  generate  a  time  series  of  synthetic 
data  with  a  controlled  correlation  structure  and  with  the  desired 
parameters  of  the  distribution  laws  to  evaluate  the  different  MCP 
methods  under  consideration.  Among  their  conclusions,  they 
report  that  only  one  of  the  five  models  tested  that  based  on  the 
bivariate  Weibull  conditional  probability  function  was  able  to 
accurately  predict  all  five  of  the  performance  metrics  under 
consideration. 

Some  authors  [130,133]  have  proposed  the  use  of  an  empirical 
method  known  as  the  Weibull  scale  method,  based  on  parameter 
relationships  of  the  two-parameter  Weibull  distribution  law.  This 
method  takes  as  its  starting  point  the  assumption  that  the 
reference  and  target  site  wind  speeds  follow  a  two-parameter 
Weibull  distribution.  Additionally,  the  existence  is  considered  of  a 
linear  relationship  between,  on  the  one  hand,  the  scale  parameters 
(C),  and,  on  the  other,  the  shape  parameters  ( K )  of  the  probability 
density  functions  of  the  target  and  reference  sites,  Eq.  (91). 


33  It  should  be  mentioned  that  this  fact  was  not  correctly  explained  in  [  141  ]. 


So,  essentially  it  is  more  a  linear  than  probabilistic  method: 


£ 

if 


1  jLJ  _  "C  1L 

t  —  ,STAr 


(91) 


In  Eq.  (91),  X  represents  the  scale  or  shape  parameter  under 
consideration. 

This  method  has  been  implemented  in  various  wind  industry 
software  applications  [24,25]  and  the  Weibull  parameters  are 
calculated  for  each  of  the  direction  sectors  in  which  the  reference 
and  target  site  wind  data  are  distributed. 

According  to  Thogersen  et  al.  [24,105],  one  advantage  of  this 
method  is  its  ability  to  adapt  to  the  nature  of  the  wind  at  most 
sites,  but  care  should  be  taken  when  applying  this  method  at  sites 
where  the  Weibull  distributions  are  not  representative  of  their 
wind  regimes.  When  considering  frequencies,  the  modified  long¬ 
term  distribution  must  be  normalized  for  the  N  direction  sectors, 
0r,  under  consideration: 


f?(f>  r)  = 


ftT(#r)  fLT(n 

fMr  ( 


y  ft  (°r)  fLT(f) 
L  fST,,.  Jr  Wr) 

0=1  fr  (dr) 


(92) 


4.5.  Estimation  methods  of  long-term  wind  characteristics  using 
multiple  reference  stations 

Though  most  of  the  MCP  methods  used  to  date  in  the  wind 
industry  have  employed  a  single  reference  station,  methods  have 
been  proposed  in  the  literature  based  on  the  use  of  various 
reference  stations  to  estimate  the  wind  characteristics  at  a 
target  site. 

Probst  and  Cardenas  [61  ]  are  of  the  opinion  that  it  may  often  be 
advisable  to  consider  various  reference  stations  with  simulta¬ 
neously  measured  datasets  to  estimate  the  long-term  wind  char¬ 
acteristics  of  a  target  site.  They  state  that  a  multiple  linear 
regression  is  often  assumed  [66-67]  between  the  wind  speed 
variables.  According  to  Brower  [9],  sometimes  the  different  refer¬ 
ence  stations  capture  different  aspects  of  the  wind  climate  at  the 
target  site  and  a  multiple  linear  regression  can  be  a  practical  way 
of  improving  the  general  correlation,  as  the  weights  given  to  each 
reference  station  in  the  fit  depend  on  the  correlation  of  each 
station  with  the  target  site  station  and  its  statistical  independence 
from  the  other  stations.  However,  Brower  [9]  warns  that  there  is  a 
risk  of  obtaining  poor  results  as  a  consequence  of  an  over-specified 
fit  if  the  number  of  stations  used  is  too  high  and  they  are  strongly 
correlated. 

Walls  et  al.  [176]  undertook  a  study  which,  for  estimation  of  the 
long-term  target  site  wind  speed,  was  based  on  information 
provided  by  two  reference  stations  (rl,  r2)  located  18  km  and 
29  km  from  the  target  site  (t),  and  on  the  use  of  what  they  term 
planar  regression: 

(Vi)tT  =  P  i(vf)rt  +  p2(Vi)rl  +  a  +  £i  (93) 

The  authors  reported  that  the  mean  absolute  error  and  the 
standard  deviation  of  errors  fell  when  planar  regression  was  used 
compared  to  the  results  obtained  when  using  only  one  of  the  two 
reference  stations.  However,  planar  regression  did  provide  a  small 
improvement  in  the  accuracy  of  the  estimation  based  on  short¬ 
term  data  periods. 

Oh  et  al.  [177]  propose  a  strategy,  which  they  call  the  com¬ 
plementary  MCP  technique,  to  reconstruct  wind  data  time  series 
from  stations  with  missing  data  periods  that  prevent  having 
continuous  long-term  data  series  of  sufficient  length  for  proper 
analysis.  The  proposed  strategy  requires  various  nearby  stations 
that  can  provide  the  information  needed  to  fill  in  the  missing  data 
period/s.  Their  proposal  does  not  involve  regression  models  which 
simultaneously  use  information  from  various  stations,  but  rather 
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simple  linear  regression  techniques  between  two  stations  to 
restore  the  missing  wind  speeds  and  directions  using  the  station 
with  the  higher  coefficient  of  determination,  R2,  as  the  reference 
station. 

Casella  [178],  based  on  the  argument  that  when  various 
reference  stations  are  available  it  may  be  preferable  to  perform  a 
multiple-correlation  analysis  to  improve  the  accuracy  of  the  target 
site  estimation,  proposes  a  new  probabilistic  MCP  method,  named 
the  J-tris  method.  It  constitutes  an  extension  of  the  method 
published  by  Garcia-Rojo  [139],  but  which  in  this  case  uses  a  joint 
probability  function  constructed  from  the  simultaneous  measure¬ 
ments  recorded  at  three  weather  stations.  Here,  the  data  of  the 
short-term  period  of  the  target  station  are  corrected  by  the  long¬ 
term  data  from  the  two  reference  stations. 

In  the  1980s,  Barros  and  Estevan  [116]  proposed  using  a 
method,  which  they  called  the  Multiple  Climatic  Reduction  Tech¬ 
nique  (MCRT),  to  estimate  the  mean  annual  wind  speed  and  its 
yearly  variation  at  a  target  site  using  just  two  or  three  months' 
worth  of  wind  speed  measurements  for  that  site.  In  the  procedure 
followed  by  the  authors,  if  V(L  x  M)  is  a  matrix  whose  columns  j 
are  the  time  series  with  L  data  from  M  weather  stations,  it  is 
possible  to  represent  the  elements  vtj  through  the  following 
equation: 

Vy=  S  aikwk j;  i=  1...L;  j=\...M  (94) 

k=  1 

where  aik  are  the  elements  of  an  orthogonal  matrix  A  (Lx  I) 
formed  by  the  eigenvectors  of  the  product  of  matrix  V(lxM)  and 
its  transposed  matrix  VT  (JW  x  L),  and  wkJ-  are  the  elements  of  a 
matrix  W  (Lx  M)  which  correspond  to  projections  of  the  data  from 
each  station  onto  each  row  of  A  (L  x  L). 

Like  all  MCP  methods,  this  one  is  based  on  the  assumption  that 
the  wind  speeds  have  some  degree  of  spatial  correlation  and, 
therefore,  there  exists  some  sort  of  dependence.  Based  on  this,  it  is 
assumed  in  the  procedure  that  the  elements,  Vy,  can  be  repre¬ 
sented  with  reasonable  accuracy  through  the  following  equation: 

Na 

Vij  =  X  ai,kWkj;  i=l...L;  j=  1...M  (95) 

k=  1 

where  Na  <  minimum(L,M}  is  the  number  of  eigenvectors  that 
must  be  chosen  so  that  Eq.  (95)  conserves  a  significant  percentage 
of  the  variance  expressed  by  the  trace  of  WT. 

If  the  data  of  the  target  station  0  =  0  are  incomplete  and  only  a 
certain  length  is  available  of  the  time  series  i  =  T1...T2,  with  T1>1, 
T2<L  and  T2-T1  +  1  <  Na,  the  values,  vkt,  can  be  expressed  by 
Eq.  (94),  even  though  they  have  not  been  used  to  calculate  the 
elements  of  the  matrix  A. 

The  method  proposed  is  based  on  minimisation  of  the  mean 
square  errors,  Eq.  (96),  which  are  produced  between  the  observa¬ 
tions  made  at  the  target  station  0=0  and  their  representation  by 
means  of  empirical  orthogonal  functions,  Eq.  (95): 

i  =  T2  i  =  T2  j  Na  \  2 

I  £u  =  I  vu-  £  aikwk[  (96) 

i  =  T1  i  =  T1  \  k=  1  / 

Minimisation  of  Eq.  (96)  leads  to  a  linear  system  of  Na 
equations  and  Na  unknowns  (wk  t).  Once  the  unknowns  have  been 
found,  the  unknown  wind  speeds  of  the  target  station  can  be 
estimated  by  applying  the  following  equation: 

Na 

Vi,t=  z  aikwkt;  i=  1...7T-1  a  i  =  T2  +  1...L  (97) 

k  =  1 

In  1988,  Barros  and  Schmidt  [179]  described  a  method  to  find 
the  minimum  span  of  data  required  to  extend  a  climatic  data 
series  to  a  specified  longer  period  using  the  MCRT. 


Zaphiropoulos  et  al.  [180]  propose  a  method  for  wind  speed 
and  direction  estimation  using  only  a  short-term  data  period  from 
a  target  site  and  long-term  data  records  from  various  nearby 
weather  stations.  The  technique  is  based  on  the  use  of  statistical 
analysis  and  the  inclusion  of  lodging  methodology  based  on  the 
works  of  Haslett  and  Raftery  [181],  In  [180],  the  authors  expand 
upon  an  earlier  work  of  theirs  [182]  by  using  10-min  averaged 
instead  of  day-averaged  wind  data  as  well  as  by  estimating  the 
wind  speed  in  16  directions. 

Briefly,  the  method  involves  using  a  matrix  V  (JW  x  L)  whose 
rows  i  are  the  time  series  of  JW  weather  stations  whose  data  L  were 
recorded  over  a  long-term  period,  except  for  one  (i=t)  where  only 
a  short-term  period  of  measured  data  is  available  j  =  Tl...T2,  with 
T1>1  and  T2<L.  In  the  proposed  method,  the  general  least  square 
estimator  of  the  mean  of  the  square  roots  of  the  target  site  wind 
speeds  4  is  given  by  the  following  equation: 


—,LT  Qi[VST-VL 
f  <Jt,t 


(98) 


where  vST  =  (vf ,  ...,vfT,  ...V%)T  and  vLT  =  (vf, ...,  vf,  ...vf)7  are 
vectors  whose  elements  are  the  means  of  the  square  roots  of  the 
short-term,  Eq.  (99),  and  long-term,  Eq.  (100), 35  wind  speeds 
calculated  with  the  observed  data  of  each  of  the  JW  stations 
considered.  Qj  =  (qu,  q2_t, .  >s  the  transpose  of  the 

column  vector,  z=t,  of  the  inverse  of  the  square  matrix  R  (JW  x  JW), 
whose  elements  are  defined  by  Eq.  (101  ).36  In  Eq.  (101),  ddkz  and 
rkz  represent  the  distance  and  the  correlation  coefficient,  respec¬ 
tively,  between  the  station  k  and  the  station  z.  0<;/<l  and  y>0  are 
two  parameters  of  the  equation  that  represents  the  relationship 
between  the  data  rkz  and  dkz : 
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rK  z  = 


V  exp (~rddkz) 
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(k±z) 
(k  =  z)  ’ 


!=  1...JW 


(101) 


is  an  estimator  that  does  not  take  into  consideration  the 
data  from  the  other  sites,  while  vf  is  an  estimator  that  does  make 
use  of  these  data.  However,  it  should  be  pointed  out  that  the 
technique  used  by  the  authors  only  gives  an  estimation  of  the 
overall  speed  measure  for  the  target  site.  That  is,  the  method  does 
not  offer  predictions  for  the  different  periods  in  which  the  data 
were  measured. 

Eq.  (98)  is  based  on  the  assumption  that  Eq.  (101)  and  the 
spatial  covariance  structure  [180,182]  hold,  and  on  the  assumption 
that  no  time  dependence  exists.  Given  that  in  this  approach  there 
are  many  underlying  kriging  procedures,  vf  is  called  the  kriging 
estimator.  The  authors  assume  that  vf  follows  a  normal  law  and 
they  formulate  an  expression  for  the  variance  of  this  estimator. 

Denison  et  al.  [184]  propose  a  method  for  spatial  data  analysis 
with  the  aim  of  predicting  the  wind  speed  at  a  new  site  for  which 
only  a  short-term  period  of  data  is  available.  The  authors  use  a 
completely  non-linear  model,  specifically  the  BMARS  (Bayesian 


34  The  distributions  of  the  wind  speeds  at  the  stations  analyzed  by  the  authors 
were  significantly  asymmetric.  So,  they  applied  a  square  root  transformation  with 
the  aim  of  stabilizing  the  variance  and  making  the  distribution  approximately 
normal,  as  also  performed  in  other  studies  [183]. 

35  Except  for  the  target  site,  where  the  long-term  mean  is  here  considered  zero. 

36  The  equation  picks  up  the  fact  that  the  correlation  between  the  data  of  the 
stations  falls  as  the  distance  between  these  stations  rises. 
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multivariate  adaptive  regression  spline),  given  its  ability  to  fit  well 
to  data  in  the  presence  of  irrelevant  predictors  and  because  of  its 
overall  interpretability.  In  their  study,  they  compare  their  method 
with  that  introduced  by  Haslett  and  Raftery  [181],  using  a  dataset 
recorded  in  areas  of  topographical  complexity  in  the  island 
of  Crete. 

Denison  et  al.  [184]  state  that  this  method  is  particularly 
suitable  for  topographies  of  complex  terrain  where,  unlike  the 
data  analysed  by  Haslett  and  Raftery  [181],  no  systematic  relation¬ 
ship  exists  between  the  distance  between  stations  and  the 
correlation.  The  authors  stress  that  the  novelty  of  the  model  that 
they  propose  lies  in  the  fact  that  it  can  be  used  to  give  a  good 
predictive  performance  even  in  the  presence  of  a  small  spatial 
correlation  structure. 

In  recent  years,  data  mining  and  its  applications  have  aroused 
much  interest  [185],  In  this  context,  automatic  learning  techniques 
have  been  proposed  that  enable  the  use  of  two  or  more  reference 
stations.  These  include  techniques  that  employ  statistical  learning 
algorithms,  such  as  Bayesian  networks  (BNs)  [103],  techniques 
based  on  biology-inspired  algorithms,  such  as  Artificial  Neural 
Networks  (ANNs)  [102]  as  well  as  other  alternative  data  mining 
techniques,  including  multivariate  statistical  techniques  [186], 

Carta  et  al.  [103]  propose  using  Bayesian  network  classifiers  to 
estimate  the  long-term  mean  wind  speed  frequency  distribution  at 
a  site  with  few  wind  resource  measurements. 

The  method  they  propose  allows  to  use  multiple  reference 
stations  with  a  long  history  of  wind  speed  and  direction  measure¬ 
ments.  That  is,  the  proposed  method  can  use  regional  information 
of  the  wind  resource.  The  intelligent  system  that  is  used,  the 
knowledge  base  of  which  is  a  joint  probability  function  of  all  the 
variables  of  the  model,  employs  efficient  calculation  techniques  of 
conditioned  probabilities  to  carry  out  its  reasoning.  This  allows  to 
perform  efficiently  the  automatic  learning  of  the  model  and  to 
perform  the  inference  from  the  available  evidence.  The  proposed 
model  was  applied  to  wind  speeds  and  wind  directions  recorded 
at  four  weather  stations  located  in  the  Canary  Islands  (Spain), 
Fig.  21 .  Among  the  conclusions  reached  by  the  authors  of  the  study 
was  that  the  BN  with  three  reference  stations  gave  lower  errors 
between  the  real  and  the  estimated  long-term  mean  wind  turbine 
energy  output  than  those  obtained  with  two  standard  MCP 
algorithms  used  for  comparison  purposes. 

Artificial  Neural  Networks  have  been  proposed  and  applied  in 
renewable  energy  systems  [187],  In  2000,  Addison  et  al.  [188] 
proposed  a  neural  network  version  of  the  MCP  algorithm  for 
estimating  wind  energy  yield.  The  authors  investigated  the  possi¬ 
bility  of  using  neural  networks  to  make  long-term  energy  yield 
predictions  for  a  potential  wind  site.  In  their  study,  they  consid¬ 
ered  the  effectiveness  of  neural  networks  in  the  prediction  of  wind 
speed  at  a  target  site  using  wind  speed  and  direction  measure¬ 
ments  from  just  one  reference  station.  The  authors  compared  this 
technique  with  standard  MCP  algorithms  used  in  the  wind  energy 
industry.  They  concluded  that  with  this  technique  an  improved 
accuracy  of  5-12%  in  the  prediction  was  achievable.  Similar 
proposals  have  subsequently  been  made  in  other  studies  [189]. 

A  large  number  of  the  proposed  models  that  use  various 
reference  stations  use  only  the  recorded  wind  speeds  at  these 
stations  as  the  input  layer  signals.  The  topologies  of  the  ANNs  used 
have  only  a  single  neuron  in  the  output  layer,  with  the  output 
signal  being  the  estimated  wind  speed  at  the  target  site.  In  other 
words,  most  of  the  models  used  ignore  the  influence  that  the  wind 
direction  at  the  reference  sites  might  have  on  the  estimation  of 
target  site  wind  speeds  [190-192].  Lopez  et  al.  [193]  did  study  the 
influence  of  wind  direction,  in  angular  magnitude,  on  wind  speed 
estimation  in  complex  terrain.  However,  of  the  five  anemometer 
stations  used  in  the  study,  the  same  one  was  always  used  as  the 
target  station  and  the  correlation  coefficients  between  stations 


were  of  the  same  order.  Velazquez  et  al.  [102]  presented  the 
results  of  a  study  undertaken  in  the  Canary  Archipelago  (Spain)  in 
which  the  influence  was  investigated  of  various  factors  (number  of 
reference  stations,  degree  of  correlation  between  target  and 
reference  station  wind  speeds,  wind  direction  and  the  manner  in 
which  the  direction  signal  was  introduced  into  the  input  layer  of 
an  ANN)  on  the  estimation  of  wind  turbine  generated  power  at  a 
target  site.  The  authors  concluded  that  when  wind  direction  was 
used  in  angular  magnitude  as  input  signal  of  the  ANNs,  Fig.  22,  the 
metrics  obtained  gave  better  results  than  those  obtained  when 
that  direction  signal  was  omitted.  They  also  report  that  estimation 
errors  tend  to  fall  as  the  number  of  reference  stations  used  rises. 
However,  on  occasions  the  addition  of  some  reference  stations  did 


Fig.  21.  Graph  of  a  Bayesian  network  model  described  in  Ref.  [103],  The  relation¬ 
ships  of  independence/dependence  between  the  variables  (wind  speeds  and 
directions)  which  make  up  the  model  can  be  seen  in  the  graph.  It  is  similarly 
possible  to  construct  the  graph  of  a  model  which  allows  to  estimate  the  target  site 
wind  direction. 


Fig.  22.  Schematic  diagram  of  an  ANN  with  2N  wind  speed  and  angular  wind 
direction  input  signals  of  N  reference  stations  and  two  wind  data  output  signals  of 
the  target  station. 
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not  lead  to  an  improvement  in  the  estimation  error.  According  to 
Faulkner  [48],  though  some  studies  show  that  ANNs  can  give  good 
results,  these  have  not  been  much  better  than  other  methods  and, 
in  general,  they  are  difficult  to  implement. 

Patane  et  al.  [194]  state  that  data  from  anemometer  stations 
with  long  measurement  periods  and  numerical  weather  models 
can  provide  a  large  amount  of  potential  inputs  for  MCP  methods. 
To  process  all  the  available  data,  the  authors  developed  a  multi¬ 
variate  MCP  methodology  (MMCP).  This  technique  efficiently  takes 
into  account  all  the  inputs  and  extracts  the  maximum  amount  of 
relevant  information  at  the  target  site.  Given  that  the  selected 
variables  can  be  highly  intercorrelated  and  that  some  of  them  may 
not  provide  information  about  the  target  site  wind  characteristics, 
the  MMCP  method  is  developed  in  a  two-stage  process.  In  the  first 
stage,  the  input  variables  are  analysed  and  those  that  contain  little 
or  redundant  information  about  the  target  site  wind  character¬ 
istics  are  discarded.  In  the  second  stage,  a  multivariate  regression 
is  performed,  similar  to  the  analysis  of  principal  components, 
based  on  the  cross-correlation  matrix  of  the  selected  variables  and 
the  time  series  of  target  site  wind  speeds  [186],  As  a  result,  an 
extrapolation  of  the  wind  speed  time  series  at  the  target  site  is 
obtained.  According  to  the  authors  [194],  it  can  be  concluded  from 
the  results  of  their  tests  that  the  MMCP  method  was  more  accurate 
than  a  standard  MCP  method  used  for  purposes  of  comparison. 
Their  method  increased  the  quality  of  the  estimation  of  the  long¬ 
term  wind  resource  by  19%  with  respect  to  the  standard  MCP 
method. 


The  matrix  method  [24,105],  as  commented  on  earlier,  also 
allows  to  estimate  the  long-term  wind  direction  by  using  Eq.  (78). 

The  methods  proposed  by  Achberger  [160]  and  Nielsel  et  al. 
[74],  given  that  they  estimate  the  components,  [(Vj)^7  and  [(vj)y]rV  • 
of  the  long-term  wind  speed  at  the  target  site,  can  also  estimate 
the  long-term  wind  direction  through  the  following  equation: 
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(102) 


According  to  Roger  et  al.  [106J,  as  a  conclusion  of  the  studies 
they  made,  the  so-called  “vector”  model  [74]  increases  the  bias 
associated  with  linear  regressions,  but  it  does  predict  relatively 
well  the  wind  direction  distribution,  even  in  sites  with  significant 
wind  direction  deviations  with  respect  to  the  reference  site. 
However,  they  state  that  though  the  vector  method  seems  to  be 
suitable  for  wind  direction  modelling,  more  research  is  needed  to 
find  the  best  wind  direction  prediction  method. 

The  techniques  of  Bayesian  networks  [103]  and  of  ANNs  [102] 
allow  estimation  of  both  long-term  wind  speed  and  direction. 


4.6.  Estimation  methods  of  the  long-term  wind  direction 

It  can  be  seen  from  the  descriptions  of  the  methods  of  wind 
speed  estimation  given  in  the  preceding  sections  that  long-term 
target  site  wind  direction  is  not  commonly  estimated  in  traditional 
MCP  methods.  Generally,  the  long-term  target  site  wind  direction 
is  assumed  to  be  the  same  as  the  wind  direction  recorded  over  a 
year  long  period  at  that  site.  However,  several  studies  have  been 
made  which  do  propose  specific  procedures  for  estimating  the 
long-term  target  site  wind  direction  [24,98,105,107,122]  and  some 
of  the  technologies  proposed  for  wind  speed  estimation  also  allow 
estimation  of  long-term  wind  direction  [74,102-103,160], 

Riedel  et  al.  [107]  propose  using  a  functional  relationship 
between  the  short-term  measured  wind  directions  at  the  refer¬ 
ence  and  target  sites  to  estimate  the  long-term  target  site  wind 
direction.  Their  proposal  involves  eliminating  data  of  the  wind 
direction  series  that  correspond  to  wind  speeds  below  3  m/s  and 
using  Chebyshev  polynomials  [195]  as  a  function  for  predicting  the 
wind  direction.  Similarly,  Derrick  [122]  proposes  fitting  a  poly¬ 
nomial  to  the  wind  direction  data  of  the  concurrent  measuring 
period.  Subsequently,  this  polynomial  has  to  be  applied  to  the 
long-term  wind  direction  distribution  of  the  reference  station  to 
obtain  a  prediction  of  the  long-term  target  site  wind  direction 
distribution.  Derrick  [122]  proposes  elimination  of  data  of  the 
wind  direction  series  below  4  m/s.  According  to  the  author,  the 
‘meandering  behaviour’  of  the  vanes  of  the  anemometers  with  low 
wind  speeds  may  completely  conceal  the  trends  of  wind  directions 
for  high  wind  speeds. 

The  SpeedSort,  DynaSort  and  Scatter  methods,  proposed  by 
King  and  Hurley  [98],  perform  veer  analysis  for  each  data  of  the 
time  series  in  the  concurrent  data  period.  Veer  is  defined  as  the 
difference  between  the  wind  direction  at  the  target  and  reference 
sites.  Average  veer  for  a  given  direction  sector  is  determined  as  the 
mean  of  all  the  hourly  veer  values  in  that  sector.  The  sector-based 
veers  are  applied  to  the  long-term  reference  site  data  to  generate  a 
wind  direction  frequency  table  for  the  target  site.  According  to  the 
authors,  Scatter  was  the  best  method  for  estimating  the  long-term 
wind  rose  shapes. 


5.  Uncertainties  in  results  of  MCP  analyses 

As  mentioned  in  Introduction  section  of  this  review,  since  the 
economic  viability  of  a  wind  farm  depends  fundamentally  on  its 
energy  production,  tools  like  the  MCP  methods  are  of  vital 
importance  for  estimating  the  long-term  wind  resource  at  wind 
farm  candidate  sites  [2-9].  Nonetheless,  wind  resource  estimation 
is  only  useful  if  the  uncertainty  associated  with  the  estimation  is 
properly  understood  by  the  agents  involved.  In  other  words  by  the 
investors,  land  owners,  credit  institutions,  etc. 

There  are  several  sources  of  uncertainty  [15,46,51-52]  and 
among  them  is  found  the  uncertainty  associated  with  the  estima¬ 
tion  of  long-term  wind  characteristics.  To  analyse  the  set  of 
uncertainties,  use  has  been  made  of  deterministic  and  stochastic 
methods,  such  as  the  Monte  Carlo  method  [46], 

To  assess  the  uncertainty  of  the  predictions  made,  users  of  MCP 
methods  only  have  the  measured  wind  data  of  the  concurrent 
period  at  both  reference  and  target  sites  and  the  long-term  data 
recorded  at  the  reference  site.  This  means  that  there  is  effectively 
little  information  available  to  determine  the  uncertainty.  In  addi¬ 
tion,  as  pointed  out  by  Bass  et  al.  [104],  the  uncertainty  related  to 
the  long-term  predictions  of  MCP  methods  depends  much  more 
on  the  data  than  on  the  actual  method  used.  That  is,  though  some 
methods  may  turn  out  to  be  better  than  others  at  estimating  the 
long-term  wind  characteristics  of  a  particular  target  site,  to  a  large 
degree  the  uncertainty  depends  on  the  extent  to  which  the 
conditions  imposed  by  these  methods  to  ensure  their  effectiveness 
(described  in  Section  2)  are  met.  In  other  words,  uncertainty 
depends  to  a  large  extent  on  compliance  with  wind  measurement 
protocols,  on  the  similarity  of  the  reference  and  target  site 
climates,  on  the  knowledge  of  seasonal  and  daily  wind  variations 
and  on  climate  stability.  Within  the  framework  of  these  restric¬ 
tions,  various  statistical  models  have  been  proposed  to  assess  the 
uncertainty  of  the  prediction  of  MCP  methods.  According  to 
Faulkner  [48],  no  robust  or  universal  method  has  yet  been  found 
to  quantify  this  uncertainty.  However,  there  are  some  rough  guides 
and  rules  of  thumb  that  can  be  used.  Some  of  these  are  provided 
in  the  studies  that  have  been  carried  out  on  MCP  methods 
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[47-48,50,104].  In  general,  they  conclude  that  the  longer  the 
concurrent  data  period  and  the  more  extensive  the  long-term 
data  time  series  for  the  reference  site,  the  lower  will  be  the 
uncertainty  of  results  obtained  with  MCP  methods.  According  to 
Faulkner  [48],  some  studies  have  found  a  vague  relationship 
between  the  correlation  coefficient  and  uncertainty,  but  this 
relationship  is  neither  consistent  nor  conclusive.  Bass  et  al.  [104] 
report  that  they  detected  a  clear  relationship  between  uncertainty 
and  the  differences  in  climate  type  or  terrain  complexity. 

Faulkner  [48]  believes  that  a  key  question  is  whether  the 
relationship  that  is  established  in  the  concurrent  data  period  is 
representative  of  the  long-term  relationship.  However,  he  also 
notes  that  there  is  no  direct  way  of  determining  this  and,  there¬ 
fore,  there  is  no  direct  way  of  quantifying  the  uncertainty.  Despite 
this,  all  the  statistical  models  used  to  assess  uncertainty  have  used 
measures  of  the  variability  in  the  relationship  between  the  data  of 
the  reference  and  target  sites  during  the  concurrent  data  period  to 
predict  the  variability  of  the  predictions. 

Derrick  [17,122]  uses  the  variances  and  covariances  of  the 
slope,  p,  and  the  offset,  a,  to  estimate  the  standard  deviation  of 
the  long-term  wind  speeds  predicted  with  a  linear  regression 
model  applied  to  different  reference  site  wind  direction  sectors. 
However,  according  to  Rogers  et  al.  [47],  this  method  under¬ 
estimates  the  overall  uncertainty  and  erroneously  assumes  serial 
independence  as  this  is  not  the  case  for  wind  speeds.  According  to 
these  authors,  such  an  assumption  may  significantly  affect  the 
results  and  furthermore,  they  state,  this  approach  can  only  be 
applied  to  MCP  linear  regression  models. 

There  are  various  strategies  that  can  be  used  to  evaluate  the 
level  of  accuracy  of  the  models.  One  approach  consists  of  using  the 
full  set  of  data  from  the  concurrent  period  to  train  the  model  and 
assess  its  quality.  However,  this  strategy  favours  those  models 
which  overfit  the  training  dataset  and  do  not  generalise  for  other 
data.  Rogers  et  al.  [47]  use  the  jackknife  estimate  of  variance  [170], 
while  Carta  et  al.  [103]  and  Carta  and  Velazquez  [141]  employ 
cross-validation  [170],  In  [103,141],  the  authors  use  10-fold  cross 
validation  with  the  aim  of  using  the  variance  of  the  10  partial 
sampling  errors  to  estimate  the  variability  of  the  learning  method 
with  respect  to  the  evidence. 


6.  Conclusions 

This  paper  offers  a  general  review  of  a  wide  collection  of  MCP 
methods  proposed  for  use  in  wind  energy  analysis.  This  review 
includes  the  initial  methods  proposed  in  the  1940s  which  gen¬ 
erally  attempted  only  to  estimate  the  long-term  mean  annual 
wind  speed  from  a  single  reference  station,  and  extends  up  to  the 
present  day  and  the  most  recent  methods  based  on  automatic 
learning  techniques  that  consider  several  reference  stations  and 
which  are  not  yet  commonly  used  in  the  wind  energy  industry. 
This  review  covers  more  than  150  presentations  at  international 
wind  energy  congresses,  papers  published  in  IF  journals  in  the 
field  of  renewable  energies,  commercial  software  containing  MCP 
modules,  PhD  and  Master's  theses  obtained  from  a  variety  of 
universities,  books  published  by  prestigious  publishing  houses  as 
well  as  research  studies  undertaken  in  this  field.  The  reading  and 
examination  of  the  above  works  has  enabled  the  authors  of  the 
this  paper  to  present  not  only  a  description  of  the  linear,  non¬ 
linear  and  probabilistic  transfer  functions  used  by  the  different 
algorithms,  but  also  the  hypotheses  on  which  they  are  based  and 
the  format  in  which  they  work  (time  series  or  frequency  distribu¬ 
tions).  Also  commented  upon  are  the  restrictions  in  the  use  of  the 
MCP  models,  the  uncertainty  associated  with  them  as  well  as  the 
different  reference  data  sources  that  have  been  used.  All  of  which 
amounts  to  an  interesting  catalogue  of  MCP  methods  which  we 


trust  will  be  of  use  for  renewable  energy  researchers,  particularly 
in  the  field  of  wind  energy. 
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